IMPACT  OF  MANAGERIAL  NONVERBAL  CUES  ON  THE  REACTIONS 
SUBORDINATES 


TIMOTHY  DEGROOT 


DISSERTATION  PRESENTED  TO  THE  GRADUATE  SCHOOL  OF  THE 
UNIVERSITY  OF  FLORIDA  IN  PARTIAL  FULFILLMENT  OF  THE 
REQUIREMENTS  FOR  THE  DEGREE  OF  DOCTOR  OF  PHILOSOPHY 


UNIVERSITY 


FLORIDA 


RESULTS 


5 DISCUSSION 

Interview  Favorability  . 

Interview  Validity  . . . 


REFERENCES 


A RATING  SCALES  FOR  INTERVIEW  . . . 
B VISUAL  NONVERBAL  CUE  RATING  SCALES 

C AUDIO  CUE  SAMPLE  FILE 

D PERSONALITY  ATTRIBUTION  SCALES  . . 
E PERSONAL  REACTION  SCALES 

BIOGRAPHICAL  SKETCH  


Abstract  of  Dissertation  Presented  to  the  Graduate  School  of 
the  University  of  Florida  in  Partial  Fulfillment  of  the 
Requirements  for  the  Degree  of  Doctor  of  Philosophy 
THE  IMPACT  OF  MANAGERIAL  NONVERBAL  CUES  ON  THE  REACTIONS  OF 
SUBORDINATES 


Timothy  DeGroot 
December  1997 

Chairman:  Stephan  Motowidlo 
Major  Department:  Management 

This  study  examines  the  impact  of  managers'  nonverbal 
cues  on  interview  ratings  and  job  performance  ratings 
mediated  through  the  reactions  of  subordinates.  As  part  of 
an  interview  validation  study,  simulated  employment 
interviews  with  110  first-level  managers  were  recorded  on 
videotape.  Job  performance  ratings  were  gathered  from 

from  Human  Resource  professionals  within  the  firm.  Visual 
nonverbal  cues  were  judged  by  trained  student  raters.  Audio 
nonverbal  cues  were  objectively  measured  by  feeding  the 
audio  portion  of  the  videotapes  into  a computer  voice 


analyzer. 


Two  mediating  variables  are  examined:  raters' 
attribution  of  managers'  personality  and  raters'  personal 
reaction  toward  the  manager.  These  data  were  collected  in  a 
laboratory  study  using  student  raters.  Raters  were  exposed 

the  visual  or  the  auditory  channel.  They  made  either 
attributions  of  the  managers'  personality  based  on  Big  Five 
dimensions  or  they  gave  their  personal  reactions  to  the 
manager.  Personal  reactions  were  based  on  14  scale  items 
asking  such  things  as  "how  much  do  you  think  you  would  like 
this  manager"  and  "how  much  would  you  trust  this  manager . " 

Audio  cues  were  significantly  correlated  with  interview 
ratings,  job  performance  ratings,  personal,  and  extroversion 
attributions.  Visual  cues  were  significantly  correlated 
with  interview  ratings,  job  performance  ratings, 
extroversion  attributions,  conscientiousness  attributions, 
and  agreeableness  attributions.  Mediation  tests  found 
personal  reactions  and  conscientiousness  attributions  to  be 
causal  mechanisms  in  the  model  for  visual  cues  and  interview 

conscientiousness  attributions  were  all  mechanisms  in  the 
model  for  audio  cues  and  interview  ratings.  Finally, 
personal  reactions  mediated  the  relationship  between  audio 


job  performance  ratings. 


The  effects  of  all  variables  on  the  validity  of  the 
interview  was  examined  also.  Significant  variance  was 
accounted  for  by  nonverbal  cues,  especially  audio  cues.  An 
examination  of  similar  usage  of  nonverbal  cues,  personality 
attributions,  and  personal  reactions  by  interview  raters  and 
job  performance  raters  using  a Brunswik  lens  model  showed 
high  common  usage  between 


What  determines  interview  performance  ratings?  Those 
who  provide  such  ratings  would  like  to  think  that  the 
content  of  answers  to  interview  questions  determines  the 
outcome  of  subsequent  ratings.  It  is  feasible,  however,  to 
believe  that  interview  answer  content  is  augmented  by  visual 
and  auditory  information,  nonverbal  cues. 

Research  on  nonverbal  behavior  has  found  a significant 
main  effect  on  interview  ratings  {e.g.,  Rasmussen,  1984). 
Often,  researchers  assume  nonverbal  behavior  has  a biasing 
effect  on  interview  ratings.  It  has  been  shown  that 
interviewees  who  use  high  levels  of  nonverbal  cues  are 
favored  (McComb  & Jablin,  1984) . It  has  also  been  argued 
that  when  nonverbal  cues  used  by  interviewers  are  also 
correlated  with  job  performance,  those  cues  legitimately 
belong  in  the  true  score  domain  (Motowidlo,  1986) . Thus, 
nonverbal  cues  might  enhance  interview  validity  or  they  may 
be  sources  of  error  (Burnett  6 Hotowidlo,  1994) . 

In  an  employment  interview,  the  interviewer  has  access 
to  information  about  past  job  behavior  through  the  content 
of  answers.  The  interviewer  can  also  observe  impression 


management  behavior  used  by  the  interviewee  in  an  attempt  to 

impression  management  behaviors  reflect  the  desire  to  land 
the  job  and  do  not  reflect  actual  intentions  of  job 
performance,  the  possibility  exists  for  inaccurate  ratings 
of  interview  performance.  Another  argument  could  be  that 
interviewers  are  obtaining  an  immediate  sense  of  the 
interviewee  established  by  visual  and  vocal  cues  that  may  be 
valid.  That  is,  the  content  of  interviewee  impression 
management  behavior  may  not  reflect  future  job  performance, 
but  the  cues  themselves  may. 

Valid  predictions  can  be  made  from  a combination  of 
behavioral  and  nonbehavioral  nonverbal  cues  (Burnett  & 
Motowidlo,  1994) . Furthermore,  judgments  based  only  on 
aural  cues  correlate  substantially  with  the  same  judgments 
based  on  only  visual  cues  in  the  interview  (Motowidlo  & 
Burnett,  1995) . Possibly,  traits  related  to  managerial 
effectiveness  can  be  accurately  detected  through  visual  cues 
or  aural  cues  alone.  In  structured  interviews,  the 
interviewer  is  trained  to  ignore  these  cues  because  they  are 
assumed  to  cause  rating  error.  However,  considering  that 
the  cues  are  present,  they  are  probably  being  attended  to 
and  used  in  making  ratings.  For  the  job  performance  rater. 


relationship 


ratings  may  actually  indicate  cue  validity. 

This  research  examines  the  relationships  between  both 
types  o£  nonverbal  cues,  visual  and  audio,  interview  ratings 
and  job  performance  ratings.  A mediating  model  is  offered 
with  causal  variables  that  may  explain  the  relationship 
between  nonverbal  cues  and  both  types  of  ratings.  Then, 
interview  validity  is  examined  after  partialling  out  the 
variance  from  nonverbal  cues.  Finally,  a Brunswik  lens 
model  is  used  to  assess  the  degree  to  which  interview  raters 
and  job  performance  raters  similarly  use  nonverbal  cues, 
personality  attributions,  and  personal  reactions  to  make 
their  ratings. 


CHAPTER  2 
LITERATURE  REVIEW 

Resgai;gh_Madel 

This  research  examines  effects  of  nonverbal  cues  on 
interview  ratings  and  ratings  of  job  performance.  However, 
research  examining  the  effect  of  nonverbal  cues  on  interview 
ratings  has  not  offered  an  empirical  explanation  for  the 
relationship.  Thus,  this  study  attempts  to  discover  why 
different  cues  are  related  to  interview  ratings,  and  even 

hypothesized  mediating  model  that  will  be  tested  to  explain 
the  relationships  between  nonverbal  cues  and  performance 
ratings,  both  interview  and  job  performance. 

Two  variables  are  hypothesized  to  mediate  the  nonverbal 
cue — interview  rating  (job  performance  rating)  correlation. 
At  the  top  of  the  model  is  "personal  reactions  to  ratee." 
Personal  reactions  are  the  way  a person  affectively  feels 
toward  a speaker.  Items  asking  such  things  as  "how  much  do 
you  think  you  would  like  this  manager"  and  "how  much  would 
you  trust  this  manager"  reflect  what  is  meant  by  personal 


IMPACT  OF  NONVERBAL  CUES 
ON  PERFORMANCE  RATINGS 


the  effects  of  personal 


controlled,  the  relationship  between  nonverbal  cues  and 
ratings  drops/  personal  reactions  to  the  speaker  will  be  a 
partial  explanation  for  the  correlation  between  nonverbal 
cues  and  ratings. 

Similarly/  if  the  variance  from  personality 
attributions  of  the  ratee  is  partialled  out  and  the 
correlation  drops  for  cues  and  ratings/  this  variable  will 
be  a partial  explanation.  Personality  attributions  are 
assessments  of  personality  traits  made  by  a person  about  the 
speaker.  In  this  case,  attributions  of  managers' 
personality  traits  are  made  by  raters  trained  to  act  like 
subordinates.  To  describe  the  links  shown  in  the  model/  a 
review  of  the  literature  on  each  component  follows. 

Dependent  Variables 

In  this  study,  two  dependent  variables  are  studied: 
interview  ratings  and  job  performance  ratings.  A review  of 
the  literature  for  each  follows. 

Interview  Eatings 

practitioners  to  rely  less  upon  the  interview  as  a sole 
selection  device  because  its  predictive  validity  is  not  as 
high  as  other  predictors.  Though  ubiquitous  in 
organizations/  the  interview  suffered  from  the  effects  of 
the  "received  doctrines":  poor  validity  and  differential 


validity  due  to  idiosyncratic  interview  processes  (Barrett, 
1972) . However,  recent  research  suggests  that  the  interview 

Cronshaw,  19BB;  Harris,  1969) . Specifically,  the  use  of  a 
job  analysis  to  base  structured  interview  questions  has 
dramatically  improved  validity.  Much  work  has  focused  on 
developing  different  types  of  structured  interviews  in 
attempts  to  increase  predictive  validity.  What  might  be 
more  important  now  is  to  search  for  specific  variables 
involved  in  the  structured  interview  that  affect  the 
favorableness  of  interview  ratings  and  ultimately, 


Unstructured  Interviews 

By  their  very  nature,  unstructured  it 
construed  as  situations  characterized  by  uncertainty  and 
ambiguity  (Kacmar,  Delery,  & Ferris,  1992) . This  type  of 
interview  works  without  a predetermined  s 
usually  without  specific,  behaviorally  a 
scales.  Interviewers  ask  what  they  feel 
questions,  then  use  their  ability  to  mak 
from  among  the  candidates.  Obviously,  a 
variance  is  inherent  in  this  style.  An 
s considered  many  variables  t 


questions  a 
pertinent 


moderating  effects  on  unstructured  interview  validity. 


Individual  Differences 

Compounding  the  problem  of  different  validities  for 
interview  formats  is  the  issue  of  interviewer  individual 
differences.  Even  if  the  most  valid  format  is  used, 
something  about  the  interviewer  could  skew  true  scores. 

That  is,  some  interviewers  will  be  more  valid  than  others. 
Therefore,  if  the  most  valid  interview  format  can  be  found, 
and  the  best  interviewers  are  selected,  the  possibility 
exists  that  training  can  occur  to  make  them  equally  valid. 

Some  empirical  research  on  the  effectiveness  of  various 
interview  training  programs  has  occurred.  Dougherty,  Ebert, 
and  Callender  {1986}  provided  strong  support  for  the  thesis 
that  some  interviewers  are  more  valid  than  others.  They 
trained  interviewers  to  elicit  useful  information  and  use  it 
to  rate  interview  dimensions.  What  they  found  were  large 
increases  in  the  validity  of  interviewer  judgments.  Maurer 
and  Fay  (1988)  conducted  an  experiment  where  training 

note  taking  and  rating  errors  like  halo.  However,  they 
found  no  effect  of  the  treatment  on  interrater  reliability. 
Conversely,  Fay  and  Latham  (1982)  found  training  to  combat 
rating  errors  actually  decreased  them. 

The  role  of  demographic  variables  for  predicting 
interview  outcomes  and  interview  ratings  appears  more  modest 
and  limited  than  commonly  believed  {Raza  & Carpenter,  1987) . 


review  (1982),  Harris  (1969)  summarized  studies  done  on  the 
concluded  that  recent  research  has  found  little  relationship 

Campion  and  Arvey  (1989)  noted  that  fewer  than  1*  of 
litigation  in  fair  employment  practice  issues  between  1979- 
1987  had  anything  to  do  with  the  selection  interview. 
Nonetheless,  legal  problems  in  employee  selection  are  always 
a salient  issue.  They  summarized  four  areas  that  could  be 
legally  problematic  in  the  interview:  1)  preinterview 
deterrence,  or  prescreening  of  applicants;  2)  training  and 
experience  of  interviewers;  3)  prohibited  interviewer 
behavior;  and  4)  subjectivity  in  evaluation.  Employers  are 
urged  to  pay  close  attention  to  these  specific  areas  to 
enhance  legal  defensibility  of  the  interview  (Campion  s 

Confirmatory  bias,  when  people  attempt  to  confirm  their 
hypotheses  about  phenomena,  is  a cause  of  individual 
differences  among  interviewers  and  is  always  suspect  in 
social  psychology  research.  However,  in  studies  in  an 
interview  context,  little  evidence  is  found  of  this 
potential  problem  (e.g.,  Sackett,  1982;  McDonald  S Hakel, 
1985) . It  seems  that  confirmatory  bias  occurs  only  under  a 


context  (Harris,  1989) . Possibly,  interviewers  do  not  make 
first  impressions  of  applicants  that  they  try  to  confirm 
throughout  the  interview.  Perhaps  it  simply  does  not 
greatly  affect  final  judgments. 

Mood  of  the  interviewer  has  been  examined  as  a possible 
moderator  of  interview  ratings  (Baron,  1987} . Baron 
predicted  that  interviewers  would  assign  higher  ratings  and 
reach  more  favorable  employment  decisions  about  interviewees 
when  in  a positive  rather  than  a negative  mood  (1987) . His 
results  confirmed  this  prediction  and  supported  the  previous 
finding  that  mood  often  exerts  significant  effects  upon 
memory  (e.g.,  Forgas  et  al.,  1984). 

Another  possible  effect  on  the  validity  of  unstructured 
interviews  comes  from  coaching  for  preparation  of  what  might 
be  asked  in  unstructured  interviews.  This  type  of  biasing 
activity  occurs  especially  in  campus  recruiting  at 
universities,  interviewees  are  provided  with  socially 
desirable  answers  to  use  in  the  interview  in  an  effort  to 
use  impression  management  to  land  the  job.  Obviously, 
inappropriately  influenced  interviewer  judgments  may  have  no 
correlation  with  the  individual's  potential  performance  at 
all.  For  example,  Kacmar  et  al.  (1992)  did  a study  on 
impression  management  techniques  used  in  interviews.  They 
found  that  higher  ratings  would  be  given  to  an  applicant  who 


self-focused  impression  management  techniques  than 


an  applicant  who  used  other-focused  tactics.  Self-focused 
techniques  maintain  attention  on  the  candidate;  the 
interviewee  gives  up  being  the  focus  of  attention  and 
instead  employs  more  subtle  mechanisms  of  influence  when 
using  other-focused  tactics  (Kacmar  et  al.,  1992).  This 

were  used,  the  applicant  received  more  job  offers  and  fewer 
rejections  than  the  group  using  the  other-focused  tactics. 

More  recent  research  provided  robust  evidence  that 
these  earlier  findings  on  increased  usage  of  impression 
management  behaviors  resulting  in  more  job  offers  may 
generalize  beyond  the  laboratory  (Stevens  & Kristof,  1995) . 
That  is,  the  positive  relationship  between  applicants' 
impression  management  tactics  and  their  actual  interview 
outcomes  was  confirmed  in  a field  setting  (Stevens  S 
Kristof,  1995) . However,  Baron  (1989)  found  that  there  can 
be  a "too  much  of  a good  thing"  effect  with  impression 
management.  Over  several  studies,  he  found  that  impression 
management  techniques  worked  to  an  extent,  then  began  to 
have  negative  effects  on  ratings  (Baron,  1989) . 

computer  administration  of  questions  might  help  to  avoid 
social  desirability  responses.  Martin  and  Nagao  (1989) 
examined  effects  of  different  administrative  modes  on 


interviewee 


socially  desirable  responding 


computer  administration.  However,  Lautenschlager  and 
Flaherty  (1990)  found  that  computer  administration  led 
increases  in  the  impression  management  components  of 
observed  questionnaire  scores.  Clearly,  removing  the 
interviewer  from  the  process  does  not  seem  to  solve  the 
problem  of  impression  management  caused  bias. 

, these  findings  make  it  important  for  the 


interviewer  to  be  aware  of  the  polished  interviewee.  The 
interviewer  may  believe  he  or  she  is  hiring  the  best 
qualified  candidate,  but  is  really  selecting  the  image 
portrayed  by  the  applicant  (Kacmar  et  al.,  1992).  Managing 
the  impressions  of  an  experienced  interviewer  may  be  more 
difficult,  but  since  all  applicants  conceivably  approach  an 
unstructured  interview  in  a similar  manner  (i.e.,  to  manage 
impressions  with  a self-focus) , all  interviewers  expect  the 
same  type  of  behavior  in  the  interview.  This  leaves  room 


for  the  well-polished  interviewee  to  sway  the  interviewer 
with  a modified  approach  to  "winning  over"  the  interviewer, 
despite  that  interviewer's  experience. 

Hence,  the  low  validity  coefficients  found  for  the 
unstructured  format  should  not  be  unexpected  due  to  any  or 
all  of  the  above  threats.  Based  on  results  from  their  meta- 
analysis, Wiesner  and  Cronshaw  (1986)  call  for  a moratorium 


on  unstructured  interview  research  and  suggest  that 
researchers  should  focus  on  structured  interviews.  Thus, 
many  researchers  have  examined  the  effects  of  changing  the 
format  of  the  interview  to  a more  structured  approach  in  an 
effort  to  increase  its  validity. 

Structured  Interviews 

There  is  support  that  structuring  the  employment 
interview  will  decrease  the  impact  of  the  above  described 
potential  biases  of  the  interviewer  and  raise  the  validity 
of  the  interview  {e.g..  Campion,  Pursell,  a Brown,  1988; 
Janz,  1982;  Latham  et  al.,  1980;  Motowidlo  et  al.,  1992). 
However,  to  pinpoint  exactly  what  in  structured  interviews 
has  caused  this  change  is  difficult.  Structure  is 
multidimensional  and  continuous  in  nature  (Dipboye  & 
Gaugler,  1993) . Typically,  it  involves  the  consistent 
application  of  predetermined  rules.  By  applying  these 
rules,  the  interviewer  is  relieved  of  having  to  decide  what 
to  ask  while  standardizing  questioning  across  all 
interviewees.  This  forces  the  interviewer  to  ask  the  same 
questions,  thus  lowering  cognitive  demands  on  the 
interviewer  (Maurer  & Fay,  1988) . Seemingly,  the  only 

interviewer  that  could  disturb  veteran  interviewers  who  ask 
questions  "by  feel." 


structured  interviewing 


To  differing  degrees,  the 
technique  controls  for  problems  inherent  in  unstructured 
interviews.  Yet  it  is  the  increased  validity  attached  to 
structured  interviews  that  is  of  particular  interest,  though 
we  do  not  know  why  structured  interviews  are  more  valid  than 
unstructured  interviews.  Dipboye  and  Gaugler  (1993)  discuss 
distinct  features  of  structured  interviews  that  serve  as 
potential  sources  of  predictive  validity.  First,  one  reason 
that  structured  interviews  are  predictive  might  be  that  they 
assess  job-related  samples  of  applicant  behavior  (Dipboye  & 
Gaugler,  1993) . To  the  extent  that  applicant  statements  of 
past  behavior  are  honest  and  free  from  social  desirability 
biases,  the  descriptions  of  behaviors  elicited  from 
structured  interviews  reveal  personal  characteristics,  thus 
increasing  accurate  judgment  of  the  interviewee. 

Another  possible  reason  for  the  success  of  structured 
interviews  is  the  attention  the  interviewer  pays  to 
diagnostic  information  about  the  applicant,  allowing  the 
interviewer  to  map  judgments  onto  underlying  attributes  more 
accurately  (Dipboye  & Gaugler,  1993) . Social  cognition 

interview  types:  the  coding  and  retrieving  of  information 
and  the  tendency  to  forget  information  by  the  interviewer 
(Hyer  & Srull,  1989) . With  most  structured  formats, 
notetaking  is  an  integral  part  of  gathering  information. 


Notetaking  has  been  shown  to  reduce  error  due  to 
the  recall  of  information  by  the  interviewer  (Carlson  et 

Another  validity  increasing  feature  of  the  structured 
interview  is  that  it  uses  standardized  scoring  procedures, 
while  unstructured  interviews  are  subjectively  scored. 
Scoring  is  best  done  on  behaviorally  anchored  rating  scales 
that  describe  poor,  average,  and  good  performance  (Maas, 
1965) . Systematic  job  analysis  enhances  the  validity  of 
structured  interviews  (Wiesner  a Cronshaw,  19BB) , and  is 
used  in  most  structured  interview  forms  presently  available. 

Furthermore,  decomposing  rating  procedures  yield  higher 
quality  judgments  than  holistic  judgments  typically  made  in 
unstructured  interviews  (Einhorn  a Hogarth,  1975) . Indeed, 
mechanical  combinations  of  these  decomposed  ratings  also  may 
add  validity  to  a structured  interview  (Campion,  Pursell,  a 
Brown,  1988;  Harris,  1989) . All  factors  seem  to  lead  toward 
a general  theme  of  standardization.  Still,  it  is  impossible 
to  conclude  that  standardization  is  solely  responsible  for 
the  success  of  structured  interviews.  Differences  in  some 
interview  features  may  account  for  a large  portion  of  the 
variance  in  predictive  validities  among  alternative  formats 
(Wiesner  a Cronshaw,  1988) . 


Situational  Interview 


The  situational  interview  is  based  on  a systematic  job 
analysis  known  as  the  critical  incident  technique  (Latham  et 
al.,  1980) . When  an  incident  is  deemed  critical  to  a 
specific  job,  a question  is  developed  to  find  out  how  an 
applicant  would  behave  in  that  situation.  This  format  is  a 
form  of  low-fidelity  simulation  (Motowidlo,  Dunnette,  a 
Carter,  1990;  Motowidlo  & Tippins,  1993)  because  it  presents 
job-like  situations  and  asks  applicants  what  they  would  do 
in  those  situations.  The  interview  is  conducted  by  a panel 
of  two  or  more  people.  Answers  are  scored  with  a highly 
structured  scoring  guide. 

Latham  et  al.  (1980)  try  to  argue  that  this  type  of 
interview  is  the  only  one  grounded  in  theory,  that  being 
goal  setting  theory  (Locke,  1968) . The  theory  states  that 
intentions  are  related  to  behavior.  Latham  et  al.  make  the 
assumption  that  what  interviewees  say  in  an  interview 
translates  directly  to  what  they  would  do  on  the  job. 
Patterned-BetiaYiaral  Description  interview  (PBDI). 

The  PBDI  follows  the  folk  psychology  phrase,  "the  best 
predictor  of  future  behavior  is  past  behavior."  As  a basis 
for  the  PBDI,  four  key  types  of  interview  information  are 
assumed  to  be  present  (Janz,  1989) . They  are  credentials, 
experience  descriptions,  opinions,  and 


behavior 


descriptions.  The  PBDI  relies  heavily  on  behavior 
descriptions  because  they  reveal  specific  choices  the 
applicant  made  in  the  past.  Janz  et  al.  (1986)  suggest  that 
the  use  of  behavior  descriptions  measures  typical  rather 

validity  associated  with  the  PBDI . 

Similar  to  the  Situational  Interview,  the  PBDI  begins 
with  a collection  of  critical  incidents,  using  the  incidents 
to  suggest  specific  times  in  the  applicant's  experiences 
that  are  samples  of  critical  behavior  on  the  job  (Janz, 

1982) . Patterning  consists  of  assembling  questions  to  guide 
the  interview  through  these  specific  incidents.  A typical 
PBDI  question  would  ask,  "what  did  you  do  WHEN..."  to  get  at 
specific  behaviors  in  which  the  applicant  has  engaged  (Janz 
et  al.,  1966) . The  interviewer  does  not  have  to  ask  the 
same  questions  of  everyone.  For  example,  if  it  appears  an 
applicant  has  no  relevant  job  experience,  the  pattern  of 
questions  avoids  relevant  job  experience.  From  the 
applicant's  past  behavior,  this  format  will  best  predict 
future  performance  (Janz,  1982) . 

however,  small  sample  size  has  been  a problem  in  all  cases. 
Regardless,  statistically  significant  validity  coefficients 
have  been  around  .50  in  the  few  studies  done.  Orpen  (1985) 
reported  predictive  validities  of  .61  and  .48  for  the  PBDI 


versus  .08  and  .05  for  the  unstructured  interview.  A 
problem  with  this  format  is  the  lack  of  a scoring  guide. 
Also,  many  patterns  can  occur,  all  of  which  would  have  to  be 
validated  for  direct  comparison  to  other  formats. 


Proposed  by  Campion,  Pursell,  and  Brown  (1988),  this 
technique  is  highly  structured.  It  is  constructed  by 
following  these  six  steps:  develop  questions  based  on  job 
analysis,  pose  these  questions  to  all  candidates  for 
employment,  anchor  the  rating  scales  for  scoring  the 
answers,  use  an  interview  panel  to  rate  answers, 
consistently  administer  the  process,  and  give  special 
attention  to  job  relatedness,  fairness,  and  documentation  in 
accordance  with  testing  guidelines.  In  effect,  this  style 
combines  all  characteristics  from  both  the  Situational 
Interview  and  the  PBDI. 

Following  these  developmental  procedures.  Campion, 
Pursell  and  Brown  (1988)  built  a 20-item  interview.  In 
their  study  of  it,  this  interview  had  an  internal 
consistency  reliability  of  .72.  Their  reported  validity 
coefficient  between  the  interview  and  performance  appraisal 
was  .34  uncorrected,  and  .56  corrected  for  range  restriction 
and  criterion  unreliability. 

The  large  correlations  in  this  study  may  suggest  that 
the  interview  has  a substantial  job  knowledge  or  cognitive 


ability  component  {Campion  et  a 1 . , 1988).  This  leads  to 
equating  the  CSI  to  an  orally  executed  cognitive  ability 
test.  It  seems  the  more  structure  in  the  interview,  the 
larger  the  correlation  with  cognitive  ability  tests.  Given 
the  predictive  power  and  low  cost  of  these  tests,  one  might 
question  the  use  of  the  interview  at  all.  Campion  et 
al.(1988)  performed  incremental  validity  analysis  that 
revealed  the  interview  added  little  additional  explained 
variance  over  the  aptitude  tests.  Conversely,  the  aptitude 
tests  added  additional  validity  to  the  interview  alone. 


Motowidlo  et  al.  (1992)  describe  this  format  as 
borrowing  from  both  the  patterned  and  situational  styles. 
Its  characteristics  include  the  following: 

1)  it  is  based  on  a critical  incident  job  analysis; 

2)  it  is  organized  around  behavioral  dimensions 
discovered  in  the  job  analysis; 

3)  it  uses  a standard  set  of  questions  about  how 
applicants  handled  past  situations  that  are  like 
situations  that  might  happen  on  the  job  and  that  might 
elicit  behavior  representing  an  interview  dimension; 

4)  the  interviewer  asks  discretionary  probing  questions 
for  details  of  the  situation,  the  interviewee's 


behavior 


interviewer 


progresses ; 

6)  after  performing  all  the  above  procedures  and 
reviewing  his  or  her  notes,  the  interviewer  rates  the 
applicant  on  behaviorally  anchored  scales;  and 

7)  the  ratings  are  combined  as  an  unweighted  sum  to  yield 
a total  interview  score  to  guide  selection  decisions 
(Hotowidlo  et  al.,  1992). 

This  format  is  more  structured  than  the  patterned  style 
because  interviewers  ask  the  same  questions  of  all 
applicants.  However,  it  is  similar  to  the  patterned  style 
since  it  emphasizes  questions  that  elicit  descriptions  of 
past  behavior  instead  of  hypothetical  situations.  Use  of 
the  behaviorally  anchored  rating  scales  is  perhaps  the  best 
attribute  borrowed  from  the  Situational  Interview.  Other 
strengths  are  its  "discretionary"  component,  that  is, 
allowing  for  the  interviewer  to  decide  when  and  how  to  probe 
for  additional  information,  and  the  notetaking  dimension  to 
avoid  interviewer  memory  problems.  This  format  seems  to  use 
all  the  positive  aspects  of  its  predecessors. 

Support  is  evident  for  the  reliability,  criterion- 
related  validity,  and  construct  validity  of  the  SBI 
(Motowidlo  et  al.,  1992).  The  SBI  was  tested  on  a sample  of 
approximately  500  interviewees,  as  opposed  to  the  PBDI  that 
was  tested  using  39  interviewees.  Results  from  a series  of 


studies  by  Motowidlo  and  his  associates  yielded  an 
uncorrected  validity  of  .22.  Further,  the  interview' 


validity  can  be  attributed  to 

its  behaviorally  structured 

format  and  that  interviewers  w 

'ho  follow  the  format  most 

closely  make  the  most  accurate 
(Motowidlo  et  al.,  1992). 
Summary:  Interview  Batinas 

judgments  about  interviewees 

The  preceding  review  of  t 

he  interview  literature  shows 

that  structured  interviews  all 

ow  the  best  chance  at  avoiding 

rater  biases,  impression  management,  and  other  forms  of 
interview  rating  errors.  To  the  extent  that  interview 
ratings  are  made  using  the  facets  of  structured  interviews. 


the  effects  of  nonverbal  cues 
attributed  to  non-arti factual 

on  interview  ratings  can  be 

Job  Performance  Batinas 
Job  performance  is  typically  measured  with  a 
performance  appraisal . Performance  appraisal  is  the  process 
by  which  an  observer,  usually  a direct  supervisor,  rates  the 
job  performance  of  a subordinate.  The  first  problem  is 
determining  what  is  performance.  The  supervisor  must  choose 
whether  to  attempt  to  rate  the  behaviors  observed  on  the  job 


to  try  to  quantify  the  outcome 

of  the  subordinate's  work. 

This  is  a problem  of  judgment  < 

Df  performance  versus 

performance . 


If  judgment  of  performance  is  deemed  appropriate,  one 
must  deal  with  idiosyncratic  processes  used  by  different 
raters.  The  cognitive  process  used  to  judge  performance  is 
given  in  a model  described  by  Denisi,  Cafferty,  and  Meglino 
(X984) . There  are  six  steps: 

1)  observe  behavior; 

21  form  some  cognitive  representation  of  observed 
behavior; 

3)  store  the  representation  in  memory; 

4)  retrieve  stored  information  to  make  judgments; 

5)  integrate  this  information  with  other  information 
available;  and 

6)  render  evaluations  using  a suitable  instrument. 

Slovic  and  Lichtenstein  (1971)  concluded  that  judges  have 
difficulty  weighting  and  combining  information  causing  them 
to  resort  to  simplified  decision  strategies. 

Potential  problems  exist  at  each  point  of  this  process, 
but  evaluative  integration,  where  one  must  recall  behavior 
and  weight  it  (Ilgen  4 Feldman,  19831,  is  a main  problem. 

One  cause  of  this  problem  may  be  that  social  and  situational 
constraints  on  supervisor  ratings  are  present  (Judge  & 
Ferris,  19931 . In  this  research,  a link  between  supervisor 
inference  of  a subordinate' s self  rating  and  the 


supervisor's  rating  was  found.  This  may  indicate  an 
accountableness  to  subordinates  or  another  social  reason. 
Researchers  can  minimise  these  problems  by  gathering 
performance  appraisals  made  exclusively  for  the  research 
purposes . 

Measurement  of  Perlsimance 

Measurement  of  performance  is  the  other  problem  in 
performance  appraisal.  To  adequately  measure  performance, 
one  must  use  proper  methods  and  have  access  to  a valid  input 
of  performance  that  samples  the  entire  performance  domain. 
Since  few  firms  have  direct  outcome  measures  suitable  for 
overall  performance  measures,  performance  appraisals  are 
accepted  as  best  surrogate  measure  of  performance.  One 
advantage  of  supervisory  ratings  of  performance  is  their 
flexibility.  Performance  can  be  indexed  to  any  dimension 
for  which  a definition  can  be  articulated  and  a rating  scale 
designed  (Borman,  1991) . other  advantages  include  not  being 
artificial  like  a work  sample,  and  they  can  avoid 
contamination  and  deficiency  (Borman,  1991) . These  issues 
arise  when  trying  to  evaluate  these  ratings  for  usefulness. 

usefulness  of  performance  appraisal  data:  rating  errors, 
accuracy,  and  psychometric  properties.  Errors  such  as  halo, 
leniency,  range  restriction,  and  distortion  can  occur,  thus, 
they  can  be  used  to  evaluate  performance  appraisals.  Halo 


is  shown  by  the  intercorrelations  among  dimension  ratings. 
Leniency  is  a shift  of  mean  ratings  away  from  a scale 
midpoint.  Range  restriction  is  a measure  of  the  extent  to 
which  a distribution  clusters  around  a central  point. 
Distortion  represents  the  inappropriate  weighting  of 
dimensions.  Though  used  often  to  assess  the  usefulness  of 
performance  appraisal  data,  there  is  some  question  if  they 
really  are  rating  errors  {Cooper,  19B1;  Murphy  £ Cleveland, 

A meta-analysis  performed  by  Murphy  and  Balzer  (1989) 
showed  no  strong  correlation  between  errors  and  accuracy. 
They  reviewed  10  separate  studies  and  found  an  average 
correlation  of  -.05  (corrected  to  -.06).  They  conclude  that 
rater  errors  seem  to  contribute  to  accuracy.  Becker  and 
Cardy  (1986)  statistically  controlled  for  halo  and  it  did 
not  improve  the  validity  of  the  ratings.  They,  too, 
conclude  there  may  be  some  truth  in  effects  of  halo. 

Accuracy  is  widely  used  to  understand  the  usefulness  of 
performance  appraisal  data  because  it  requires  both  the 
correct  rank  order  and  correct  absolute  level  against  target 
scores  (Borman,  1991) . However,  many  measures  of  accuracy 
exist.  Cronbach  broke  down  difference  scores  between 
overall  ratings  and  true  scores  into  four  components 


(Cronbach,  1955) : 


1)  elevation/  which  is  difference  between  observed  and 

21  differential  elevation,  the  accuracy  in 

differentiating  among  ratees  scores  averaged 
across  dimensions  (main  effect  of  ratees) ; 

3)  stereotype  accuracy,  reflecting  difference  among 
dimensions  averaged  over  ratees  (main  effect  of 
dimensions);  and 

4)  differential  accuracy,  or  ratee  differences  in 
patterns  of  performance  (ratee  times  dimension 
interaction) . 

Borman  (1977)  used  an  index  that  he  termed  differential 
accuracy,  but  it  really  shows  the  validity  of  the  ratings 
since  it  uses  only  the  correlational  component  of  the 
psychometric  definition  of  accuracy. 

Murphy  and  Cleveland  (1991)  believe  differential 
elevation  is  the  best  measure  of  accuracy.  Some,  however, 
do  not  even  advocate  accurate  evaluations.  Longenecker, 
Sims,  and  Gioia  (1987)  suggest  the  use  of  the  performance 
appraisal  to  motivate  is  what  is  important,  not  to  get 
accurate  assessments  of  job  performance.  Nhat  this  shows 
multiple  uses  for  the  performance  appraisal. 

The  final  method  to  assess  usefulness  of  performance 
appraisals  is  through  psychometric  criteria.  Though 
internal  consistency  reliability  is  used  often  (Murphy  & 


appropriate  because 


Cleveland,  1991),  it  is  not 
dimensions  should  be  giving  distinct  information  about  job 
performance.  Interrater  reliability  is  more  appropriate  in 
this  instance.  It  suggests  that  raters  are  focusing  on 
similar  samples  of  job  behavior  (Borman,  1991),  but  it  can 
still  represent  shared  biases  of  a few  raters. 

Construct  validity  as  a psychometric  property  of 
supervisory  ratings  is  useful  as  an  indirect  measure  of 
accuracy  (Murphy  a Cleveland,  1991).  If  the  ratings  do  not 
suffer  from  distributional  errors,  illusory  halo,  first 
impression,  or  other  errors,  the  ratings  should  converge  on 
the  desired  construct  of  job  performance.  Limited  research 
is  available  concerning  construct  validity  of  ratings. 
However,  if  raters  consistently  agree,  this  is  indirect 
evidence  of  construct  validity. 


For  example,  Harris  and  Schaubroeck 

ratings.  They  also  found  that  self  and 
converged  (r  = .35)  as  well  as  self  and 
- .36,  all  corrected  for  attenuation) . 
convergent  validity. 


and  supervisor 
supervisor  ratings 
peer  evaluations  (r 
This  is  evidence  of 


There  is  further  evidence  of  convergent  validity 
between  subjective  and  objective  measures  of  performance. 
To  the  extent  that  ratings  are  correlated  with  objective 
measures  of  job  performance,  ratings  are  capturing  true 


performance  (Heneman,  1986) . Finally,  validities  for 
selection  tests  predicting  supervisory  ratings  are  as  high 
as  validities  for  selection  tests  predicting  objective 
measures  of  job  performance  (Schmitt  et  al.,  1984). 

Since  performance  appraisals  are  accepted  as  the  best 


available  methc 

d of  evaluating  individual  performance. 

accuracy  is  the 

; main  goal  for  research.  Zf  selection  of 

raters  is  made  carefully  and  proper  training  is  given  to 
raters,  accuracy  is  enhanced  (Borman,  1991).  Still,  the 
problem  of  making  the  rater  attend  to  the  task  to  do  it 


properly  still 

exists.  One  way  to  motivate  raters  to  be 

accurate  is  to 

hold  them  accountable  for  their  ratings. 

This  leads  rate 

irs  to  make  accurate  ratings,  partly  by 

reducing  their  reliance  on  irrelevant  factors  (Hero  s 
Motowidlo,  1995) . Accountability  forces  raters  to  make  the 
task  a main  responsibility  and  thus  reduces  reliance  on 
other  factors  that  do  not  help  make  good  ratings  (Tetlock, 
1985) . Raters  engage  in  a more  active  process  of  gathering 


information  and 

; of  considering  implications  of  that 

information  (Me 

ro  i Motowidlo,  1995) . 

Raters  mus 

t be  willing  to  give  accurate  ratings  or  any 

discussion  of  t 

heir  ability  to  do  so  is  irrelevant.  Rating 

quality  is  related  to  the  conditions  of  rating  context,  and 
possibly  not  related  to  idiosyncratic  rater  effects  (Mero  s 


Motowidlo,  1995) . From  a research  perspective,  to  insure 
that  performance  raters  give  ratings  as  accurately  as 
possible,  it  seems  imperative  that  job  performance  ratings 
made  exclusively  for  research  purposes  are  gathered.  If 
supervisors  know  that  their  ratings  are  not  going  to  be  used 
as  part  of  the  subordinate*  s personnel  files,  they  should  be 

of  behaviorally  anchored  rating  scales  assist  in  attaining 


Types  of  Nonverbal  cues 


Two  types  of  nonverbal  variables  exist:  behavioral  and 
non-behavioral  {De  Meuse,  1987) . In  his  review  of  the 
effects  of  nonverbal  cues  on  performance  appraisal,  De  Meuse 
states  that  most  of  the  research  has  focused  on  demographic 
cues  (non-behavioral)  and  has  ignored  the  potential 
interactions  with  physical  appearance  variables  (non- 
behavioral)  and  behavioral  variables  such  as  paralinguistic 
cues  (1987) . Nonetheless,  the  interactive  nature  between 


research  done  in  this  vein  (De  Meuse,  1987). 

More  research  has  examined  the  effects  of  nonverbal 
cues  on  interview  ratings.  The  research  in  this  vein  has 
broken  behavioral  and  nonbehavioral  cues  into  dynamic, 


static/  and  paralinguistic  cues.  Dynamic  cues  are  easily 
changed  such  as  eye  contact/  body  orientation,  smiling, 
gesturing  and  head  movement.  Static  cues  are  demographic 
variables  and  physical  attractiveness.  Paralinguistic  cues 
refer  to  vocal  characteristics  such  as  speech  rate,  volume, 
tone  and  pausing.  As  in  the  performance  appraisal 
literature,  however,  most  of  the  research  has  focused  on 

Visual  Cues 

visual  cues  can  have  powerful  effects  on  the 
favorability  of  interview  ratings.  For  instance,  the 
relationship  between  favorable  ratings  and  physical 
attractiveness  has  been  shown  (Dipboye,  Arvey,  & Terpstra, 
1977;  Baza  a Carpenter,  1987)  as  has  the  clothing  worn  in  an 
interview  (Forsythe,  Drake,  & Cox,  1985) . Hand  gestures, 
smiling,  and  the  manner  that  a job  applicant  orients  his  or 
her  body  with  regard  to  the  interviewer  all  have  had  effects 
on  interview  ratings  (Gifford,  Ng,  & Wilkinson,  1985; 
Rasumussen,  1984;  Forbes  & Jackson,  1980;  Hollandsworth, 
Kazelskis,  Stevens,  & Dressel,  1979;  Imada  & Hakel,  1977) . 
Finally,  high  levels  of  eye  contact  maintain  dominance  in 
face  to  face  interactions  (Webbink,  1986)  and  have  been 
shown  to  affect  judgments  of  applicant  qualifications 
(Parsons  & Liden,  1984) . Thus,  five  visual  nonverbal  cues 
stand  out  as  having  been  examined  for  their  effects  in 


organisational  settings:  physical  attractiveness,  smiling. 


d body  orientation. 

High  levels  of  each  of  these  individual  variables  are 
hypothesized  to  positively  relate  to  interview  and  job 
performance  ratings.  Since  each  variable  by  itself  may  only 
have  a small  relationship  with  the  dependent  variables,  an 
additive  combination  of  all  five  variables  more  accurately 
represents  the  combined  effects  of  visual  cues.  Following 
previous  findings,  this  composite  variable  is  defined  as 

Visual  cue  index  = Gaze  + Smiling  + Gesturing  + 

Physical  Attractiveness  + Body  Lean 
An  index  variable  such  as  this  can  then  be  examined  for 
overall  effects  of  visual  cues  on  interview  and  job 
performance  ratings.  Therefore,  the  first  set  of  hypotheses 
to  test  the  model  in  Figure  2-1  is: 

Hypothesis  la:  Visual  cues  are  positively  related  to 
interview  judgments. 

Hypothesis  lb:  Visual  cues  are  positively  related  to 
job  performance  ratings. 


Little  work  has  been  undertaken  to  examine  the  impact 
of  audio  cues.  A typical  study  isolates  some  paralinguistic 

subjective  ratings  to  measure  them.  For  instance,  one  study 


found  those  job  candidates  who  pause  before  answering,  focus 
on  key  interviewer  words,  and  organize  their  answer  before 
responding  decreases  speech  disturbances  and  improves  their 
speech  fluency  (Hollandsworth  et  al.,  1978).  Speech  fluency 
was  related  to  interview  ratings. 

However,  this  same  study  measured  loudness  of  voice, 
but  found  no  significant  relation  to  the  employment 
decision.  One  could  speculate  on  these  two  results.  First, 
reliability  as  a psychometric  property  was  not  examined. 

Thus,  loudness  of  voice,  for  example,  should  not  be 
dismissed  as  unrelated  to  employment  decisions  unless 
reliability  of  ratings  is  examined,  or  another  method  of 
measurement  is  found.  An  alternative  way  to  measure  vocal 
characteristics  is  objectively  with  the  use  of  computers. 
Before  discussing  objective  measurement  of  vocal 
characteristics,  a closer  look  at  audio  nonverbal  cues,  or 
voice  characteristics  is  needed. 

Evidence  exists  that  voice  characteristics  are 
indicators  of  one's  personality  or  even  the  emotional  state 
of  speakers  (Nighswonger  & Martin,  1961) . Nighswonger  and 
Martin  (1991)  noted  that  voice  pitch  analysis  was  useful  in 
marketing  research:  changes  in  emotion  can  be  detected 
through  pitch  changes  in  the 


voice.  This 


technologies  that  lie  detectors  use.  However/  a speaker  has 
relatively  little  control  over  permanent  voice  quality  nor 
can  voice  quality  be  completely  suppressed  or  disguised 
(Greene  i Mathieson,  1989),  though  accomplished  speakers  can 
use  different  patterns  of  audio  cues  when  deemed  appropriate 
(Pearce  s Brommel,  1972) . Thus,  voice  characteristics  are 
stable  attributes  of  a person  and  they  may  indicate 
individual  traits. 

Some  work  has  been  done  toward  tying  a person's  speech 
behavior  to  organizational  consequences.  Patterns  of 
individual  sound  production  were  predictive  of  success  in 
two  different  situations  (Rousey,  1974;  Rousey  s Mitchell, 
1974) . Based  on  these  findings,  the  assumption  was  made 
that  successful  individuals  in  management  should  have  normal 
communication  ability  and  unsuccessful  individuals  would 
have  disturbed  patterns  of  communication  (Rousey,  Morrison, 

S Deacon,  1995) . This  assumption  was  supported  in  their 
study  that  used  data  from  executives  at  three  levels  of 
success.  They  concluded  successful  managers  had  different 
psychological  characteristics  than  unsuccessful  managers  and 
this  could  be  inferred  from  speech  behaviors  (Rousey, 
Morrison,  & Deacon,  1995) . Thus,  there  may  be  a practical 
use  for  knowing  individual  differences  on  voice 
characteristics.  If  specific  characteristics  of  the  voice 
can  be  reliably  measured,  their  effects  on  job  performance 


ratings  and  interviewer  ratings  could  be  examined. 

One  voice  characteristic  that  has  been  heavily  studied 
is  fundamental  frequency  (FF),  or  the  pitch  of  a voice. 

Some  researchers  have  suggested  that  future  academic  success 
can  be  predicted  with  voice  pitch  and  range  (Greene  & 
Hathieson,  1989) . More  successful  students  have  been  found 
to  use  higher  pitch,  less  volume,  and  more  appropriate 
intonation  than  less  successful  peers  (Edwards,  1982) . 

Sharf  and  Lehman  (1984)  found  more  successful  phone 
interviewers  had  higher  FF  and  greater  range  in  FF  than  less 
successful  interviewers. 

Scherer,  London,  and  Wolf  (1973)  found  higher  pitch 
levels  increased  impressions  of  competence,  dominance,  and 
assertiveness.  Aronovitch  (1976)  found  this  to  be  true  for 
women,  but  not  for  men.  Scherer  (1978)  explains  this  as 
follows.  As  pitch  rises  to  a threshold  of  about  140  Hr  in 


and  as  the  pitch  approaches  female  levels,  impressions 
quickly  become  unfavorable. 

In  selection  research,  it  would  be  interesting  to  know 
if  "high"  and  "low"  voice  are  correlated  with  job 
performance.  Do  women,  or  high  voice  individuals,  receive 
higher  performance  judgments?  Or  do  high  voice  individuals 
receive  lower  ratings  from  those  judging  their  performance 


from  videotape?  One  migh 
who  have  limited  access  t 
higher  ratings  to  a low  v 


gue  that  raters  of  performance 
e ratee's  behaviors  may  give 
i female,  and  to  a low  voiced 


This  reflects  common  stereotypes  of  the  effective 
s leaders:  low  voiced  individuals  are  considered 
better  managers  through  gaining  compliance  of  followers  via 
a dominant  sounding  voice.  Social  information  processing 
theory  suggests  that  people  make  summary  judgments  of 
suitability  for  leadership  based  on  observations  of 
prototypical  attributes  (Hollander  t Offermann,  1993) . 

Range  of  Pitch 

Another  voice  characteristic  that  may  reflect  positive 
managerial  traits  is  the  range  of  frequency  an  individual 
uses  when  speaking.  A general  assumption  is  made  that  a 
speaker  is  dynamic  and  an  extravert  if  the  pitch  is  varied 
widely  (e.g.,  Greene  & Mathieson,  1989;  Scherer,  1979).  For 
instance,  Scherer  (1979)  found  that  greater  pitch 

s linked  to  perceived  speaker  dynamism, 

d benevolence  (sociability) . Those 

considered  boring.  Brown,  Strong,  and  Rencher  (1973) 
manipulated  the  variance  of  FF  in  a lab  study  and  found  that 
increased  variance  of  FF  made  the  speaker  sound  more 
benevolent,  and  to  a lesser  extent,  more  competent.  Less 
variation  of  FF  had  the  opposite  effect  in  this  controlled 


variability 

potency. 


experiment.  Therefore,  speakers  using  a larger  range  of 
those  who  vary  their  voices  considerably,  might  be 
considered  more  charismatic  and  have  their  performance  ra 
more  favorably. 

Rate  of  speech  may  be  positively  related  to  ratings 
performance,  but  only  to  a certain  point.  Brown  (1980) 
shows  a curvilinear  relationship  between  rate  and 
sociability.  Stereotypes  for  rate  of  speech  consider  rat 
socially  attractive  up  to  197-324  syllables  per  minute,  b 
after  about  376  s.p.m.,  diminish  in  attractiveness  (Brown 

Most  studies  have  found  faster  speech  rates  increase 
competence  and  extroversion  perceptions  (e.g.,  Mehrabian 
Williams,  1969;  Pearce  & Brommel,  1972;  Buller  et  al., 
1992} . Brown,  Strong,  and  Rencher,  (1973)  found  that 
slowing  the  speech  rate  makes  the  voice  sound  less 
competent,  but,  increased  rate  makes  the  voice  sound  less 
benevolent.  Those  who  speak  more  slowly  may  be  perceived 
less  intelligent  or  "disadvantaged"  (Williams,  1970) . 
Finally,  slow  speech  in  association  with  low  vocal  pitch 
recognized  as  an  indication  of  anxiety  or  despair 
(Gudykunst,  1986) . 


Voice  Breaks 

A related  variable  is  voice  breaks,  or  pauses. 

Effective  use  of  pauses  can  call  attention  to  particular 
ideas,  and  unfilled  pauses  used  in  strategic  locations  can 
enhance  speech  (Feldman  a Rime,  1991).  If  the  pauses  are 
used  such  that  they  contribute  to  the  speaker  becoming  more 
fluent,  then  pausing  is  good.  Hollandsworth,  Kazelskis, 
Stevens,  and  Dressel  (1979)  found  fluency  of  speech  strongly 
contributes  to  the  employment  decision.  These  authors  found 
if  a candidate  pauses  before  answering  a question,  speech 
fluency  is  improved  and  favorable  interview  evaluations 


Up  to  a moderately  high  level,  vocal  intensity,  or 
loudness,  is  positively  related  to  impressions  of 
extroversion,  sociability,  and  dominance  (Aronovitch,  1976; 
Scherer,  1976) . Conversational  speech  measured  three  feet 
from  the  speaker  has  an  average  intensity  of  60  decibels 
(DB) , quiet  speech  is  approximately  35-40  DB,  and  shouting 
is  75  DB  (Fry,  1979) . However,  the  variability  of  this 
intensity  provides  quite  different  information  about  the 
speaker.  Amplitude  perturbation  is  short  term  instability 
of  the  intensity  in  a vocal  signal  (Greene  a Mathieson, 
1989} . Also  called  shimmer,  measurement  of  this  quality 


quantifies  the  variability  of  the  intensity  (amplitude) 


the  voice.  This  variability 
perception 
undesirable  (Baken, 


las  been  associated  with  the 
smooth  speaker,  one  who  would 


evoke  higher  favorabiiity  judgments,  woul 
of  shimmer. 

To  summarize,  certain  voice  characteristics  may 
correlate  with  different  performance  ratings  because  of  the 
"pleasantness"  of  the  voice  sound  or  the  way  that  the  voice 
falls  into  stereotypes  of  the  genders  and  beliefs  of  what 
constitutes  "good"  managers.  Possibly,  voice 
characteristics  affect  contextual  performance  by  getting 
others  to  perform  better.  For  instance,  a manager  with  a 
"pleasant"  sounding  voice  may  induce  followers  to  exhibit 
more  helping  behavior,  and  thus,  increase  the  effectiveness 
of  the  manager.  Research  has  shown  evidence  of  the  presence 
of  an  "attractive  voice"  (Zuckerman  6 Driver,  1989; 

Zuckerman  6 Miyake,  1993;  Zuckerman,  Hodgins,  & Miyake, 

1990) . Thus,  a combination  of  audio  cues  described  here  may 
encompass  this  attractive  voice.  A review  of  the  attractive 
voice  literature  may  help  to  see  its  organizational 
significance  as  an  indicator  of  audio  cues. 

Attractive  Voice 

Every  voice  is  unique  to  the  speaker  (Greene  & 
Mathieson,  1989) . Some  voices  are  more  "attractive" 

l others.  It  is  possible  that  attractive  voices 


are  associated  with  more  favorable  impressions  similarly  to 
the  physical  attractiveness  stereotype.  Studies  have  shown 
that  attractive  people  actually  develop  more  favorable  self- 
images  and  manifest  more  confident  behavior  patterns 
(Zuckerman  & Driver,  1969) . These  authors  propose  a vocal 
attractiveness  stereotype  that  will  represent  the  influence 
of  the  auditory  portion  of  appearance  much  like  the  physical 
attractiveness  stereotype  that  represents  the  influence  of 
the  visual  portion  of  appearance. 

To  the  extent  that  impressions  vary  as  a function  of 
vocal  characteristics,  these  impressions  likely  will  vary  as 
a function  of  vocal  attractiveness  (Zuckerman  & Driver, 

1989) . If  there  exists  reliability  among  raters  of 
attractive  voice,  the  case  can  be  made  for  a vocal 
stereotype.  This  research  has  been  performed  and  shows  that 

with  interrater  reliability  estimates  averaging  .65 
(Zuckerman  & Miyake,  1993) . 

Furthermore,  Zuckerman  and  Driver  (1989)  found  that 

favorably  than  senders  with  less  attractive  voices.  These 
authors  point  out  that  this  finding  is  robust,  since  the 
effects  have  been  replicated  across  two  studies.  However, 
they  were  unsuccessful  in  determining  the  dimensions  of  a 

Vocal  attractiveness  had  greater  effects 


stereotype. 


on  attributions  of  dominance  in  one  study,  but  larger 
effects  on  likability  in  their  second  study.  In  an  effort 
to  elaborate  upon  these  findings,  Zuckerman,  Hodgins,  and 
Miyake  (1990)  used  the  NEO  subscales  of  personality  (Costa  a 
McCrae,  1985) . Results  from  this  study  point  to  an 
attractive  voice  that  judges  find  lacks  tension,  but  has  the 
presence  of  confidence  without  giving  an  indication  of 
whether  the  person  is  good-natured  (Zuckerman,  Hodgins,  6 
Miyake,  1990) . 

Familiarity  weakens  effects  of  the  vocal  attractiveness 
stereotype  when  raters  are  used  to  describe  targets 
(Zuckerman,  Hodgins,  6 Miyake,  1990) . Perhaps  objective 
measures  of  voice  characteristics  can  better  explain  what 
determines  a vocal  attractiveness  stereotype.  Zuckerman  and 
Miyake  (1993)  looked  at  both  subjective  and  objective 
measures  of  voice  quality  on  general  favorableness  of 
impressions.  They  found  that  both  types  of  measures 
predicted  vocal  attractiveness.  However,  if  an  objective 
way  to  measure  vocal  cues  were  available,  the  problems 
inherent  in  subjective  ratings  of  attractive  voice  could  be 
avoided. 

From  this  review,  five  audio  variables  have  been  shown 
to  represent  vocal  attractiveness  adequately.  These  five 
variables  could  be  combined  to  form  an  overall  vocal 
attractiveness  measure,  or  the  audio  cues  in  Figure  2-1. 


The  literature  review  presented  supports  the  following 
combination  of  these  variables  that  form  an  index  where  high 
values  are  hypothesized  to  lead  to  higher  ratings  of 
performance : 


Audio  Cue  Index  = Speech  Rate  + Voice  Breaks  + Range  of 
Pitch  - Amplitude  Variability  - Pitch 
With  this  index  representing  audio  cues*  the  next  set  of 
hypotheses  is: 

Hypothesis  2a:  Audio  cues  are  positively  related  to 
interview  judgments. 


e positively  related  t 


f job  performance. 

Mediating  Variables 
to  nonverbal  Cues 

a speaker  has  an  attractive  voice  combined  w 


physical  attractiveness,  often  favorable  ratings  can  result. 
This  combination  also  could  be  explained  as  social 
competence,  since  nonverbal  behavior  has  been  argued  as  a 
manifestation  of  social  competence  (Feldman,  Philippot,  s 
Custrini,  1991) . Though  social  competence  is  difficult  to 
define,  it  broadly  relates  to  evaluative  judgments  of  the 
adequacy  of  a person's  social  performance.  Interview 
favorability  could  be  associated  with  the  concept  of  social 
competence.  If  an  interviewee  displays  positive  social 


favorable  ratings 


performance  in  the  interview,  more 

Nonverbal  cues  may  elicit  personal  reactions  toward  the 
speaker.  In  De  Meuse's  11987)  model  of  the  impact  of 
nonverbal  cues  on  the  performance  appraisal  process,  two 
dimensions  of  interpersonal  behavior,  affective  evaluation 
and  perceived  power,  mediate  the  relationship  between 
nonverbal  cues  and  performance  ratings.  Affective 
evaluation  represents  the  degree  of  liking  the  rater  has  for 
the  ratee.  Perceived  power  reflects  the  amount  of  informal 
dominance  the  ratee  has  in  the  ratee/rater  relationship 
(Schlenker,  1980) . This  model  predicts  that  the  impact  of 
nonverbal  cues  on  ratings  of  performance  is  mediated  by  the 
degree  to  which  the  ratee  is  judged  positively  on  affective 
qualities.  General  support  has  been  found  for  the  model 
(e.g.,  Edinger  & Patterson,  1963;  Mehrabian,  1972; 

Schlenker,  1980) . However,  the  mediating  variable 
"perceived  power  of  the  ratee"  is  not  relevant  concerning 
interview  ratings,  since  the  interviewee  has  little  power  in 
this  situation.  Thus,  that  link  will  not  be  examined  here. 
Nevertheless,  the  other  mediating  variable  in  this  model, 
"affective  evaluation  of  the  ratee,"  is  relevant  in  both  job 
performance  appraisal  and  interview  performance  appraisal. 
This  affective  evaluation  may  encompass  several  different 
personal  reaction  variables. 


Personal  Reactions 

The  literature  has  examined  the  effects  of  personal 
reaction  variables.  For  instance,  about  a third  of  the 
variance  of  amount  of  liking  felt  by  strangers  on  first 
meeting  is  accounted  for  by  nonverbal  cues  (Maxwell,  Cook,  & 
Burr,  1985) . These  positive  first  impressions  are  gained  by 
using  the  expressive  behaviors  gestural  fluency,  speaking 
fluency,  and  body  emphasis  such  as  head  movements,  etc. 
(Riggio  s Friedman,  1986) . Generally,  positive  evaluations 
are  attributed  to  those  performing  behaviors  that  are  more 
active  (Shrout  £ Fiske,  1981) . Those  who  use  more  active 
expressive  behaviors  (i.e.,  more  gestures,  more  vocal  range, 
etc.)  increase  the  attention  paid  to  them  by  the  observer 
that  in  turn,  may  cause  affirmative  evaluations  to  become 
more  positive  (Taylor  £ Fiske,  1978) . Riggio  and  Friedman 
(1986)  found  strong  relations  between  nonverbal  cues  and 
likability,  especially  speaking  fluency  for  men  and  facial 
expressions  for  women. 

Nonverbal  variables  associated  with  an  individual' s 
level  of  persuasiveness  have  been  examined  also.  For 
example,  physically  attractive  people  have  been  found  more 
persuasive  than  unattractive  people  (Chaiken,  1979)  . Facial 
appearance  in  particular  has  been  shown  to  interact  with 
speaker  credibility  in  determining  attitude  change  as  a 
result  of  persuasive  speech  (Brownlow,  1992) . Burgoon 


(1990)  found  a positive  relationship  between  gaze  and 
persuasiveness . 

Audio  cues  that  connote  pleasantness  foster  favorable 
credibility  judgments  and  have  been  associated  with 
persuasiveness  {Burgoon,  Birk,  & Pfau,  1990) . These 
researchers  interpret  their  findings  to  mean  that 
credibility  and  persuasiveness  are  simultaneous  outcomes 
directly  influenced  by  nonverbal  variables.  Other  audio 
cues  have  been  positively  correlated  with  persuasiveness 
such  as  range  of  fundamental  frequency  (Burgoon,  1990), 
pitch  (Sharf  i Lehman,  1989),  and  speech  rate  (Burgoon, 

Competence  as  a personality  trait  has  been  shown  to 
correlate  with  nonverbal  cues.  Scherer,  London,  and  Wolf 
(1973)  found  some  vocal  characteristics  increased 
impressions  of  competence,  dominance,  and  assertiveness. 
Range  of  FF,  which  indicates  expressive  voice,  increased 
perceptions  of  competence  in  speakers  (Pearce  & Conklin, 
1971;  Ray,  1986;  Burgoon,  1990).  Visual  cues  such  as 
physical  attractiveness  (Hickiing  et.  al.,  1979),  smile 
(Chaiken,  1979),  and  gaze  (Tipton  & Rymer,  1978)  all  have 
been  positively  related  with  competence. 

Individuals  who  were  physically  attractive  (Corrigan, 
1976),  who  smiled  a lot  (La  France  & Hecht,  1995),  and  who 
gazed  directly  at  the  listener  (Burgoon  et.  al.,  1984)  were 


seen  as  more  trustworthy  than  those  who  exhibited  low  levels 
of  these  variables.  Similarly,  listeners  were  more  apt  to 
comply  with  speakers  who  smiled  a lot  (O' Quin  S Aronoff, 
1981)  and  who  held  the  gaze  of  the  listener  (Willis  s Hamm, 
1980) . Speech  rate  has  been  shown  to  increase  levels  of 
compliance  also  (Buller  & Aune,  1988) . 

Finally,  the  degree  to  which  one  will  help  another  has 
been  explored  with  regard  to  nonverbal  cues.  Valentine 
(1980)  found  a positive  relationship  between  gaze  and 
helping  behavior.  In  another  study,  expressive  voice  (range 
of  FF)  was  related  to  increased  helping  (Goldman  & Fordyce, 
1983) . Thus,  the  literature  has  shown  seven  personal 
reaction  variables  to  be  related  to  nonverbal  cues. 

Summary;  Personal  Reactions  to  Nonverbal  Cues 

The  relationship  between  nonverbal  cues  and  personal 
reaction  variables  has  been  established  from  the  review 
above.  A variable  that  carries  the  combined  effects  of 
personal  reactions  will  be  used  in  this  study.  This  index 
represents  the  degree  of  trust  in  the  speaker  (trust) , 
degree  of  liking  felt  for  the  speaker  (liking),  degree  to 
which  the  listener  feels  the  persuaded  by  the  speaker 
(persuade) , degree  of  competence  felt  toward  the  speaker 
(competent) , degree  of  dominance  listener  feels  the  speaker 
has  (dominant),  degree  to  which  one  would  comply  with  the 


help  the 


speaker  (comply),  and  degree  to  which  one  would 
speaker  (help) . The  variable  is  combined  as  follows: 
Personal  Reactions  ■ Trust  + Liking  + Persuade  ■* 
Competent  + Dominant  + Comply  + Help 
If  the  effects  of  nonverbal  cues  on  these  personal  reactions 
to  a speaker  can  be  shown  in  the  context  of  the 
manager/subordinate  relationship,  important  implications  for 
contextual  job  performance  and  interview  favorability  can  be 
quantified.  Thus, 

Hypothesis  3a:  Visual  cues  are  positively  related  to 
personal  reactions. 

Hypothesis  3b:  Audio  cues  are  positively  related  to 
personal  reactions. 

Cognitive  Prototypes:  Personality  Traits 

Nonverbal  cues  also  may  reflect  underlying  personality 
traits  related  to  patterns  of  behavior  that  lead  to 
favorable  judgments.  Behaviors  that  comprise  dimensions  on 
which  judgments  are  made  overlap  with  the  Big  Five 
personality  structure.  This  follows  the  logic  of  a trait 
theory  of  personality,  one  that  views  individual  differences 
in  behavior  as  organized  by  stable,  consistent  dispositions 
in  individuals.  Behavior  is  consistent  across  time  and 
situations  and  should  be  predictable  from  the  assessed 
traits.  Since  the  relationship  between  extroversion, 
agreeableness,  and  conscientiousness  has  been  established 


with  organizationally  relevant  outcomes  (e.g.,  Barrick  & 
Mount,  1991),  it  is  desirable  for  this  study  to  find 
specific  Big  5 personality  factor  concomitants  of  nonverbal 
cues.  Then,  we  can  examine  the  hypothesized  link  between 
nonverbal  cues  and  ratings  of  performance  by  looking  at 
their  relationship  with  attributions  of  personality 
variables. 

It  is  not  that  difficult  to  infer  accurately 
personality  traits  on  acquaintances,  or  even  people  to  whom 
we  have  minimal  exposure  to  their  behaviors  (e.g..  Bourne, 

it  much  more  difficult  to  infer  personality  traits  on 
individuals  with  whom  one  has  zero  acquaintance.  However, 
research  has  shown  that  even  at  zero  acquaintance  perceivers 
can  make  consistent  summary  judgments  on  strangers  (e.g., 
Albright  et  al.,  1988;  Borkenau  6 Liebler,  1992;  DiPilato, 
1989;  Norman  & Goldberg,  1966;  Hatson,  1989) . However, 

Kenny  et  al.  (1992)  note  that  only  extroversion  and 
conscientiousness  are  consistently  attributed  to  others  from 
zero  acquaintance  studies.  Agreeableness  was  the  most 
difficult  of  the  Big  5 factors  to  attribute  to  strangers. 
Thus,  though  agreeableness  is  associated  with  some 
organizationally  relevant  outcomes,  it  may  not  be  as  useful 
in  zero  acquaintance  situations,  the  case  in  this  study. 


On  the  other  hand,  research  has  shown  that  people  can 
accurately  attribute  extroversion  and  conscientiousness  on 
others.  To  illustrate,  a study  by  Borkenau  and  Liebler 
(1992)  showed  some  agreement  on  nonverbal  cue  usage  across 
personality  factors.  One  hundred  subjects  were  videotaped 
while  entering  and  walking  around  a room,  sitting,  then 
reading  a standard  text.  Twenty-four  raters  gave 
personality  ratings  of  the  experimental  subjects,  based  on 
the  Big  Five  dimensions.  These  ratings  formed  an 
attribution  measure  of  the  traits  made  by  others  to  compare 
with  self  ratings.  The  four  conditions  for  rating  were 
silent  film,  still  picture,  audiotape,  or  sound-film. 

Significant  results  were  found  for  the  correlation 
between  extroversion  and  soft  voice  for  self  ratings  (r=  - 
.26)  and  even  stronger  with  the  percept  ratings  (r=  -.42) . 
Powerful  voice  also  was  related  to  extroversion  with  the 
same  pattern  between  self  and  percept  ratings  (ra.2,  r=.46) . 
Conscientiousness  was  positively  related  to  high  voice  using 
self  ratings  (r=.22)  and  with  perceivers'  ratings  (r®.42) . 
Though  additional  significant  correlations  with  other  voice 
characteristics  were  found,  they  did  not  match  with  the  self 
ratings  in  this  study. 

Further  evidence  of  this  consistent  relationship 
nonverbal  cues  is  available.  Perceptions  of  others' 


conscientiousness  have  been  positively  correlated  with  gaze 
and  speaking  fluency  {Funder  & Sneed,  1993) , speech  rate 
(Aronovitch,  1976) , and  hand  movement  (Borkenau  i Liebler, 
1992) . Agreeableness  has  been  strongly  related  to  all 
visual  cues  {Borkenau  & Liebler,  1992;  Funder  & Sneed, 

1992),  but  little  evidence  has  supported  the  relationship 
between  audio  cues  and  agreeableness.  This  may  reflect  the 
difficulty  of  inferring  agreeableness  at  zero  acquaintance 
with  the  subject  (Albright,  Kenny,  & Malloy,  1988) . 

Besides  the  relationships  described  above,  physical 
attractiveness  (Kenny  et  al.,  1992)  and  many  audio  cues 
(Aronovitch,  1976)  have  been  positively  correlated  with 
extroversion.  Riggio  and  Friedman  (1986)  found  a strong 
relationship  between  speaking  fluency,  gestural  fluency  and 
extroversion.  Though  extroversion  has  been  the  easiest  of 
the  Big  Five  personality  factors  to  infer  upon  others,  given 
certain  nonverbal  cues  only,  conscientiousness  and 
agreeableness  to  a lesser  degree  should  be  related  to 
nonverbal  cues  since  they  are  related  to  interview  ratings 
and  job  performance  ratings.  Therefore, 

Hypothesis  4a:  Visual  cues  are  positively  related  to 
visual  personality  attributions  of  extroversion. 

Hypothesis  4b:  Visual  cues  are  positively  related  to 


visual  personality  attributions  of  conscientiousness. 


e positively  related  t 


visual  personality  attributions  o 
Hypothesis  4d:  Audio  cues  ar 
visual  personality  attributions  o 

visual  personality  attributions  o 

visual  personality  attributions  o 


positively  related  t 
extroversion, 
positively  related  t 
conscientiousness . 
positively  related  t 
agreeableness . 


1 Testing;  Fart  Q 


To  this  point,  this  work  has  hypothesized  about  the 
importance  of  audio  and  physical  characteristics  in 
determining  favorable  interview  ratings  and  ratings  of  job 
performance  by  empirically  examining  the  links  shown  in 
Figure  2-1.  Following  Baron  and  Kenny  (1986),  if 
significant  relationships  are  uncovered  in  the  first  four 
the  final  test  of  the  mediating  model 
r visual  cues  and  interview  judgments. 
Hypothesis  5a:  The  effect  of  visual  cues  on  interview 
judgments  is  mediated  by  visual  personal  reactions. 

s mediated  by  visual  personality  attributions  of 


Hypothesis  5c:  The  effect  of  visual  cues 
judgments  is  mediated  by  visual  personality  attributions  of 
conscientiousness . 


Hypothesis  5d:  The  effect  of  visual  cues  on  interview 
judgments  is  mediated  by  visual  personality  attributions  of 
agreeableness . 

Similarly,  the  model  will  be  tested  for  mediating 
effects  of  the  relationship  between  audio  cues  and  interview 
judgments. 

Hypothesis  6a:  The  effect  of  audio  cues  on  interview 
judgments  is  mediated  by  audio  personal  reactions. 

Hypothesis  6b:  The  effect  of  audio  cues  on  interview 
judgments  is  mediated  by  audio  personality  attributions  of 
extroversion. 

Hypothesis  6c:  The  effect  of  audio  cues  on  interview 
judgments  is  mediated  by  audio  personality  attributions  of 
conscientiousness . 

Hypothesis  6d:  The  effect  of  audio  cues  on  interview 
judgments  is  mediated  by  audio  personality  attributions  of 
agreeableness . 

The  preceding  tests  will  have  examined  all  links  in 
Figure  2-1  with  interview  judgments  as  the  dependent 
variable.  The  hypotheses  in  set  H5  and  H6  also  will  be 
tested  with  job  performance  ratings  as  the  dependent 
variable.  To  link  visual  and  audio  cues  in  the  interview  to 
job  performance  ratings,  the  assumption  is  made  that  the 
same  cues  are  expressed  on  the  job  and  perceived  by  job 
performance  raters.  For  visual  cues, 


effect  of  visual 


performance  ratings  is  mediated  by  visual  personal 

Hypothesis  7b:  The  effect  of  visual  cues  on  job 
performance  ratings  is  mediated  by  visual  personality 
attributions  of  extroversion. 

Hypothesis  7c:  The  effect  of  visual  cues  on  job 
performance  ratings  is  mediated  by  visual  personality 
attributions  of  conscientiousness. 

Hypothesis  7d:  The  effect  of  visual  cues  on  job 
performance  ratings  is  mediated  by  visual  personality 
attributions  of  agreeableness. 

Finally,  to  complete  analysis  from  Part  One,  the 
following  hypotheses  predict  mediating  effects  with  audio 
cues  and  job  performance  ratings: 

performance  ratings  is  mediated  by  audio  personal  reactions 

Hypothesis  8b:  The  effect  of  audio  cues  on  job 
performance  ratings  is  mediated  by  audio  personality 
attributions  of  extroversion. 

Hypothesis  ec:  The  effect  of  audio  cues  on  job 
performance  ratings  is  mediated  by  audio  personality 


amatheais  ad:  The  effect  of  audio  cues  on  job 
performance  ratings  is  mediated  by  audio  personality 
attributions  of  agreeableness. 


If  the  above  hypotheses  hold,  specific,  partial 
explanations  for  the  relationships  between  nonverbal  cues 
and  both  interview  favorability  and  job  performance  ratings 
will  be  evident.  In  the  first  section  of  Part  Two,  the 


validity  coefficient  between  the  interview  ratings  and  the 
job  performance  ratings  partialling  out  the  influence  of 
nonverbal  cues  is  examined.  This  information  would  explore 
causes  of  validity,  not  each  type  of  rating  individually. 


Hypothesis  3a:  Visual 
validity  coefficient  for  th 
Hypothesis  9b:  Audio  c 
validity  coefficient  for  th 

To  expand  what  is  learned  in 
relationships  of  cue  usage  betweei 
performance  raters  are  examined  ii 
Part  Two.  If  sufficient  variance 
coefficient  is  accounted  for  by  nonverbal 
interesting  to  know  if  interview  raters  ar 


a partial  cause  of  the 


interview  raters  and  job 
the  second  section  of 
.n  the  validity 


job  performance 


e personal  reactions  a 


attributions  the  same  when  making  their  ratings  is  also 
important.  Four  questions  are  posed  to  query  these  ideas 
Research  Question  1:  Do  interview  raters  and  job 
performance  raters  use  visual  nonverbal  cues  in  the  same 

BsssaCEh-OnesEian. 2 : Do  interview  raters  and  job 
performance  raters  use  audio  nonverbal  cues  in  the  same  w 


performance  raters  use  personal  reactions  inferred  from 
nonverbal  cues  in  the  same  way? 

Research  Question  4:  Do  interview  raters  and  job 
performance  raters  use  personality  traits  inferred  from 


nonverbal 


CHAPTER  3 
METHODS 

Participants 

First-level  managers  were  interviewed  as  part  of  a 
larger  project  to  validate  a structured  interview  for  a 
national  publishing  firm.  The  interview  was  developed  with 
nine  questions  to  assess  seven  dimensions  of  management 
effectiveness:  leadership,  concern  for  others,  diversity, 
customer  service,  problem  solving,  professional  integrity, 
and  oral  communication.  The  sample  of  110  managers  includes 
56  females  and  54  males,  all  with  less  than  three  years  of 
management  experience.  There  are  81  Whites  (74%),  17  Blacks 
(15%) , nine  Hispanics  (8%) , and  three  Asians  (3%).  Their 
average  age  is  37.3  years. 

Immediate  supervisors  provided  supervisory  ratings  on 
the  same  dimensions  used  in  the  development  of  the  interview 

Performance  ratings  were  made  exclusively  for  the  validation 


Inter.vi.ew  Ratings 

While  validating  the  interview,  videotapes  of  the 
actual  interviews  were  sent  to  Human  Resource  professionals 
in  the  firm.  They  were  instructed  to  watch  the  interviews 
using  an  interview  packet  that  contained  the  questions,  with 
space  to  take  notes.  The  same  scales  used  to  assess  job 
performance  were  used  to  make  interview  ratings.  Included 
in  the  packet  were  instructions  not  to  rate  anyone  whom  they 

Preparation  of  Experimental  Materials 
Job  performance  ratings  and  interview  ratings  were 
gathered  in  the  field,  but  the  remaining  variables  had  to  be 
collected  in  the  laboratory.  For  this  purpose,  the  sample 
of  videotaped  interviews  was  broken  into  fifths  based  on  job 
performance  ratings,  from  high  to  low.  Twenty-two  sets  of 
interviews  were  created  with  five  of  the  110  interviews  in 
each  set.  Each  set  had  one  interview  taken  from  each  fifth 
level  of  performance  ratings.  Two  minutes  randomly  chosen 
from  the  beginning,  middle,  and  the  end  of  each  interview 
was  copied  onto  a separate  videotape.  After  each  interview, 
a one  minute  blank  portion  was  allowed  for  interview 
separation.  Thus,  each  of  the  22  videotapes  contains  five 
interviews  reduced  to  a two  minute  portion  with  a one  minute 
blank  spot  between  each.  These  22  new  videotapes  were  then 
copied  to  audio  tape  for  vocal  analysis.  This  insured  that 


both  types  of  nonverbal 


Student  raters  judged  the  visual  nonverbal  cues  of  th 
interviewees.  Definitions  of  the  variables  to  be  used  in 
this  study,  physical  attractiveness,  smiling,  gaze,  hand 
movement  and  body  orientation,  were  taken  from  the 
literature  (Mehrabian,  1972;  Parsons  & Liden,  1984; 
Rasmussen,  1984;  Knapp  S Hall,  1992| . The  author  ran  ever 
session  and  personally  trained  the  raters  for  30  minutes  oi 
the  use  of  seven  point  scales  to  make  ratings  (Appendix  B) 
To  boost  reliability,  five  raters  were  used  to  rate  each 


manager.  Each  rated  five  managers,  thus,  110  students 
participated  in  this  task. 

After  the  audio  portion  of  the  videotape  was 
transferred  to  audiotape,  the  manager's  voices  were  analyzed 
for  individual  differences  on  voice  characteristics.  These 
audio  cues  can  be  measured  both  subjectively  and 
objectively.  Since  computer-based  methods  of  voice  and 
speech  analysis  have  been  demonstrated  to  be  useful  tools  to 
measure  paralinguistic  variables  objectively  (Scherer, 
London,  a Wolf,  1973),  audio  cues  in  this  study  were 
measured  with  a computer  voice  analyzer.  The  same  sections 


speech,  spaced  throughout 


about  10  seconds  long, 
seconds  because  that  ii 
However,  sample  time  w. 


! interview,  were  taken  to 
reliable.  Measurements  were 
The  software  default  is  set  at  six 
sufficient  to  sample  running  speech. 
■ increased  to  10  seconds  to  be 


certain  to  capture  speech  that  resembles  normal 
conversational  changes.  Appendix  C is  a sample  a 


Pefinitions  of  Audio  Cue  Variables 
Considering  that  objective  measurement  of  voice 
characteristics  is  relatively  unknown  outside  speech  and 
audiology  laboratories,  definitions  of  the  specific  voice 
characteristics  are  offered.  Five  variables  are  measured 


average  fundamental  frequency  o' 
h level  for  males  is  128  Hz 


is.  The  habitual  average 
nd  225  Hz  for  females 


(Boone,  1977) . Thus,  pitch  is  corrected  for  this 
physiological  gender  difference  by  subtracting  the 
difference  (97)  from  the  values  for  females.  Though 
habitual  pitch  levels  vary  according  to  the  individual  and 
the  prevailing  circumstances  (Greene  s Mathieson,  1989) , all 
participants  were  interviewed  under  the  same  circumstances 
so  variability  can  be  attributed  to  individual  differences. 


The  second  audio  variable  is  pitch  variability.  It  is 
the  standard  deviation  of  the  fundamental  frequency  within 
each  voice  sample.  In  this  study,  pitch  variability  is  the 
average  of  the  four  measurements'  standard  deviations.  This 
gives  an  estimate  of  the  pitch  variability,  or  fundamental 
frequency  range,  employed  by  the  speaker. 

Speech  rate  is  a variable  that  reflects  the  length  of 
the  pitch  period,  or  how  long  the  speaker  holds  a certain 
level  of  pitch,  when  one  is  speaking,  this  period  changes 
when  a new  sound  is  uttered,  such  as  a new  syllable.  Thus, 
each  speech  rate  measurement  is  the  average  pitch  period  {in 
milliseconds)  that  can  be  used  as  a proxy  for  speech  rate. 

no  script  for  the  interviewees  to  follow,  each  person 
uttered  different  words  making  syllables  per  second  very 
difficult  to  measure.  To  check  for  relatedness,  ten 
interviewees'  responses  to  the  first  question  were 
transcribed  exactly  and  measured  with  both  syllables  per 
second  and  the  speech  rate  proxy  described  above.  These  two 
measures  correlated  .8B,  thus,  the  proxy  seems  sufficient 
both  theoretically  and  practically. 

The  fourth  variable  is  pauses.  This  variable  measures 
the  degree  of  unvoiced  time  in  the  sample,  averaged  over  all 
measurements.  It  takes  into  account  the  time  paused,  not 


just  the  number  of  pauses. 


Finally,  amplitude  variation  gives  an  evaluation  of  the 
variability  of  the  short-term  peak  to  peak  amplitude  within 
the  voice  sample.  The  computer  measures  the  volume  of  each 
utterance;  this  variable  shows  how  much  the  speaker 
involuntarily  varies  that  level  of  volume  in  the  sample, 
averaged  over  all  measurements. 

Measures  of  Personal  Reactions  and  Personality  Traits 

a laboratory  study  was  conducted  to  gather  ratings  of 
personal  reactions  to  the  managers  and  personality 
attributions  of  the  managers.  One  half  the  students  were 
assigned  to  make  personal  reactions  to  the  speakers  and  the 
other  half  was  assigned  the  task  of  making  personality 
attributions  of  the  speakers.  To  make  their  ratings,  they 
were  exposed  to  only  one  channel  of  information,  either  the 
visual  or  the  audio  channel.  Thus,  the  four  conditions  in 
this  experiment  are:  personal  reactions-visual  information, 
personal  reactions-audio  information,  personality 
assessment-visual  information,  personality  assessment-audio 
information. 

To  sum  this  experiment,  students  either  listened  to  a 
portion  of  the  interviews  recorded  on  audiotape  or  viewed 
silent  portions  of  the  videotape  and  provided  ratings  on 
five  managers  as  though  they  were  subordinates.  They  rated 
the  managers  on  one  of  two  criteria.  One  half  the  raters 
provided  their  perceptions  of  the  managers  on  Big  5 


personality  factors  (extroversion,  conscientiousness, 
agreeableness,  neuroticism,  and  culture)  as  though  they  were 
subordinates.  They  were  instructed  on  how  to  use  the  20- 
item,  seven  point  polar  adjectives  scale  (see  Appendix  D) 
given  by  Goldberg  (1992) . 

The  other  half  of  the  raters  provided  their  personal 
reactions  to  the  managers  as  though  they  were  subordinates. 
They  were  trained  to  use  seven  point  scales  for  the  seven 
different  personal  reactions.  Each  personal  reaction  was 
measured  with  a two-item  scale,  thus,  the  entire  personal 
reaction  scale  contains  14  items  (Appendix  E) . Each  rater 
had  to  rate  five  of  the  110  managers  from  the  entire  range 
of  performance  scores  on  either  personal  reactions  or 
personality  attributions  using  only  one  type  of  cue  or 
channel  of  information.  This  required  440  student  raters 
for  the  four  conditions. 

Conducted  in  this  manner,  the  impressions  of 
supervisory  traits  that  possibly  affect  the  behavior  of 
followers  were  measured  from  the  perspective  of 
subordinates.  The  validity  of  observer,  or  zero 
acquaintance  ratings  of  the  Big  5 personality  traits  has 
been  established  (Mount,  Barrick,  i Strauss,  1994)  as  has 
the  reliability  of  other  observer  ratings  (Borkenau  & 


Correlational  analyses  were  undertaken  to  assess  the 
relationship  among  all  variables.  As  discussed  in  chapter 

variable.  Each  variable  was  standardized  before  combination 
to  avert  the  largest  mean/variance  variable  from  having 
undue  influence  on  the  index.  Again,  that  index  is: 

Audio  Cue  Index  ■ Speech  Rate  + Voice  Breaks  + Range  of 
Pitch  - Amplitude  Variability  - Pitch. 

Similarly,  the  five  visual  cues,  the  seven  personal  reaction 
variables,  and  the  four  item  personality  scales  were 
respectively  combined  into  index  variables.  Because  they 
all  use  the  same  metric,  individual  variables  were  not 
standardized  before  combination.  The  visual  cues  are  an 
additive  scale,  personal  reactions  are  an  additive  scale, 
and  personality  attributions  are  four  item  scales  scored  in 
the  positive  direction  and  summed. 

Part  One  of  this  study  tests  the  mediating  effects  of 
personal  reactions  and  personality  attributions  on  the 
relationship  between  nonverbal  cues,  interview  judgments  and 
ratings  of  performance.  If  this  relationship  shrinks  when 
the  variance  attributed  to  the  Big  5 and  personal  reaction 
variables  is  partialled  out,  it  will  be  indication  of  their 
mediating  effects.  Regression  analysis  is  used  here 
following  Baron  and  Kenny  (1986) . 


A different  analysis  tool  is  used  in  Part  Two  of  the 
study.  A Brunswikian  lens  model  has  been  suggested  as  a 
useful  alternative  research  design  for  selection  research 
(Guion,  1990) . By  choosing  the  appropriate  nonverbal  cue 
predictors,  the  lens  model  can  help  to  achieve  better 
accuracy  in  the  inference  process  of  the  interview  (Gifford 
et  al.,  1985).  What  the  lens  model  does  is  determine  the 
degree  of  agreement  between  the  two  sides  of  the  model, 
given  the  data  in  the  center  of  the  model  (Figure  3-1) . 

First,  agreement  on  the  use  of  these  nonverbal  cues  to 
make  interview  judgments  and  job  performance  ratings  can  be 
examined  by  using  criterion  performance  with  nonverbal  cue 
correlations  as  one  side  of  the  model,  and  correlations 
between  interview  judgments  and  nonverbal  cues  as  the  other 
side  of  the  model.  That  is,  the  cues  will  be  regressed  on 
both  dependent  variables,  saving  the  predictions  from  the 
regression  as  a new  variable.  The  predictions  from  each 
regression  equation  can  then  be  correlated  to  estimate 
common  usage  of  the  information  items . This  is  analogous  to 
a corrected  validity  coefficient  (Guion,  1990) . 

These  analyses  move  toward  understanding  the  effect  of 
nonverbal  cues  on  interview  validity.  Another  way  to  look 
at  this  problem  is  through  partial  correlations  and 
corresponding  variance  reduction  rates  (Chen  i Spector, 


L) . This  method  is  used  to  examine  validity  lost 
effects  of  the  cues  are  controlled. 


R - Multiple  correlation  for 
C predictability  of  criterion 

j predictability  of  judgments 


MODIFIED  BRUNSWIK  L 


CHAPTER  4 
RESULTS 

Measurement.  Reliability 

Considering  the  data  come  from  different  sources, 
including  ratings,  reliability  must  be  dealt  with  first  to 
determine  the  stability  of  the  different  measurements.  Once 
reliability  is  established,  relationships  can  be  examined 
with  confidence. 


Table  4-1  shows  the  correlation  matrix  for  the  audio 
nonverbal  cues.  Though  pausing  is  highly  correlated  with 
amplitude  variability  lr=  .44,  p<.05),  these  variables  do 
not  measure  the  same  construct.  Amplitude  variability 
measures  the  short-term  variation  in  loudness  a speaker  uses 
while  pausing  reflects  the  average  amount  of  time  the 
speaker  made  no  utterances.  Likewise,  the  correlation 
between  speech  rate  and  amplitude  variability  ( r=- . 68, 
p< . 05)  shows  a true  relationship  that  is  unclouded  by 
artifacts.  Reliability  for  these  variables  is  calculated  as 
the  average  correlation  of  the  four  measurements  taken. 


corrected  by  the  Spearman-Brown  formula.  Estimates  range 
from  .62  for  pitch  variability  to  .93  for  pitch.  Pitch 
variability  can  reasonably  be  expected  to  be  lower  since  a 
speaker  can  use  variations  in  pitch  across  measurements  much 
easier  than  varying  the  other  vocal  variables.  Important 
for  this  study,  is  the  reliability  of  the  audio  cue  index 
(a=.82) . It  is  sufficiently  high  to  use  as  an  index  of 
vocal  attractiveness. 

Visual  Nonverbal  cues 

Correlations  among  the  five  visual  nonverbal  cues  are 
shown  in  Table  4-2.  Here,  all  variables  correlate  fairly 
high,  ranging  from  .30  to  .47,  though  this  is  not  so  high 
that  one  might  think  they  measure  the  same  constructs. 

Raters  were  thoroughly  trained  in  this  aspect  of  the  study 
to  tally  each  variable  separately.  Reliability  here  is 
calculated  as  the  intraclass  reliability  of  five  raters 
corrected  by  the  Spearman-Brown  formula.  Again,  reliability 
is  good,  ranging  from  .77  to  .90,  and  is  very  high  on  the 
index  (a=.93) . Thus,  a visual  cue  index  also  can  be  used  to 


Personality  Judgments 


Personality  of  the  interviewees  was  assessed  by  trained 
student  raters  in  a controlled  laboratory  experiment.  Thus, 
two  types  of  reliability  need  to  be  assessed:  scale  and 
rater.  First,  scale  reliability  was  computed  by  calculating 
coefficient  alpha  for  each  of  the  Big  Five  personality 
factors  (Table  4-3) . Each  factor  was  assessed  with  a four- 
item  scale.  Reliability  estimates  range  from  .58  to  .78  in 
the  visual  cue  condition  and  from  .45  to  .78  in  the  audio 
cue  condition.  Considering  the  raters  had  only  one  channel 
of  information  from  which  to  make  their  personality  ratings, 
these  somewhat  low  reliability  evaluations  are  expected. 

The  lowest  reliability  is  for  agreeableness,  the  factor  that 
the  zero-acquaintance  literature  has  shown  to  be  the  most 
difficult  for  raters  to  agree  upon  (e.g.,  Borkenau  s 
Liebler,  1992) . The  other  two  organizationally  relevant 
factors,  extroversion  and  conscientiousness,  are 
sufficiently  high. 

Rater  reliabilities  for  the  personality  factors  are 
similar  to  those  above,  ranging  from  .45  to  .78  in  the 
visual  cue  condition  and  from  .37  to  .80  in  the  audio  cue 
condition  (Table  4-4).  These  are  calculated  using  the 


BIG  5 PERSONALITY  SCALE  RELIABILITY 
Condition 

Visual  Cues  Only  Audio  Cues  Only 


Extrovers  ion  . 7 B 
Conscientiousness  .65 
Agreeableness  .59 
Neuroticism  .58 
Culture  .63 


Coefficient  Alpha;  4 items  per  scale;  n = 550 


65 


intraclass  reliability  of  the  five  raters,  corrected  by  the 
Spearman-Brown  formula.  Correlations  among  the  audio 
attributions  of  the  factors  are  given  below  the  diagonal  in 
Table  4-4  and  among  the  visual  attributions  of  the 
personality  factors  above  the  diagonal.  The  correlations 
between  agreeableness  and  the  other  factors,  along  with  the 
intraclass  reliability  estimate  for  agreeableness,  support 
the  contention  made  above  that  this  construct  is  difficult 
to  agree  upon  from  an  observer  point  of  view.  Interrater 


the  visual 


condition  and  cr=.80  in  the  audio  condition)  and  fairly  high 
for  conscientiousness  (a=.65  in  the  visual  condition  and 
q= . 45  in  the  audio  condition). 

Personal  Reactions 

Both  types  of  reliability  are  important  with  regard  to 
the  measurement  of  the  affective,  personal  reactions  toward 
interviewees.  Each  of  the  seven  variables  was  measured  with 
a two-item  scale.  Internal  consistency  ranges  from  .80  to 
.91  in  the  visual  cue  condition  and  ranges  from  .86  to  .96 
in  the  audio  cue  condition  (Table  4-5) . Internal 
consistency  reliability  is  calculated  using  the  correlation 
between  the  two  items  measuring  each  construct  and 
correcting  it  using  the  Spearman-Brown  formula. 

Correlations  between  items  in  the  audio  condition  are  in  the 
first  column  with  adjusted  reliability  estimates  in  the 
second  column.  Correlations  between  items  in  the  visual 
condition  are  in  the  third  column  with  corrected  reliability 
estimates  in  the  last  column. 

Reliability  of  the  index  resulting  from  combining  these 
two  item  scales  is  shown  in  Table  4-6.  This  estimate  is  the 
intraclass  reliability  of  the  14  item  scale  adjusted  by 
Spearman-Brown  formula  for  five  raters. 


PERSONAL  REACTIONS  INTER-ITEM  CORRELATIONS  AND 
RELIABILITY  ESTIMATES 


4.  Helping 


=y  reliability  adjusted  by  Spe 


PERSONAL  REACTIONS 


RELIABILITY 


Personal  Reactions 


Intraclass  reliability  adjusted  by  Spearman-Brown  formula 
for  5 raters.  14  items  per  scale;  n -110 


The  intraclass  correlation  for  the  five  raters  of  each 
interviewee,  corrected  by  the  Spearman-Brown  formula, 
yielded  reliability  estimates  from  .66  for  Testability  to 
.74  for  Competence  in  the  visual  cue  condition.  In  the 
audio  cue  condition,  the  reliability  estimates  range  from 
.49  for  Helpfulness  to  .76  for  Competence  (Table  4-7) . 

When  the  personal  reactions  are  combined  in  an  index, 
very  high  reliability  is  observed  («  = .93  for  the  visual 
condition  and  a » .96  for  the  audio  condition}.  Therefore, 
an  index  can  be  used  to  represent  overall  personal  reactions 
to  the  managers  from  the  subordinate's  point  of  view. 
Relationships-Between.  Cuss  and  Interview  Judgments 
The  first  set  of  hypotheses  predict  a positive 
relationship  between  visual  cues  and  interview  ratings,  and 


INTRACLASS  CORRELATIONS  FOR  PERSONAL  REACTION  VARIABLES 

Condition 

Visual  Cues  Only  Audio  Cues  Only 


Competence  .74 

Compliance  .71 

Dominance  .70 

Helpfulness  .71 

Likability  .70 

Persuasiveness  .73 

Trustability  .66 

Personal  Reaction  Index  .80 

Intraclass  reliability  adjusted  by  Spearman-E 


73 


formula 


visual  cues  and  job  performance  ratings.  For  visual  cues 
and  interview  ratings,  the  correlation  is  .21  (p  < .05)  and 
for  visual  cues  and  job  performance  ratings,  the  correlation 
is  .14  (p  = .069)  (Table  4-8).  These  relationships  show 
that  visual  cues  are  taken  into  account  when  determining 
interview  ratings  and  job  performance  ratings. 

The  next  hypotheses  predicts  that  there  would  be 
positive  relationships  between  audio  cues  and  interview 
ratings  and  audio  cues  and  job  performance  ratings.  For 


interview  ratings,  the  correlation 


< .05)  and  for  audio  cues  and  job  performance  ratings,  the 
correlation  is  .20  (p  < .05)  (Table  4-8) . These 
relationships  reveal  the  same  information  as  for  visual 
cues:  audio  cues  are  taken  into  account  when  determining 
interview  ratings  and  job  performance  ratings. 

Hypotheses  3a  and  3b  predicts  a positive  relationship 
between  personal  reactions  and  visual  cues  and  audio  cues 
respectively.  These  too  were  supported,  robustly  for  visual 
cues  (r  = .59,  p < .05)  and  also  for  audio  cues  (r  = .20,  p 

< .05)  (Table  4-8) . Considering  the  raters  were  subjected 
to  only  one  channel  of  information  to  make  their  ratings, 
these  findings  show  that  people  react  in  a personal  way  to 
the  level  of  specific  nonverbal  cue  usage  of  others. 

Similarly,  positive  relationships  between  the  cues  and 
personality  attributions  were  expected.  For  visual  cues, 
all  three  organizationally  relevant  Big  5 traits  are 
significantly  related  (extroversion:  r - .40,  p < .05; 
conscientiousness:  r - .24,  p < .05;  agreeableness:  r = .15, 
p = .061) . For  audio  cues,  however,  only  extroversion  was 
significant  (r  = .31,  p < .05) . Conscientiousness  had  a 


CORRELATIONS 


CONDITION 


A)  Audio  Condition Cues 

Personal  Reactions  .20* 
Agreeableness  -.01 
Conscientiousness  . 12 
Extroversion  .31* 


Interview  Performance 
Ratings Ratings 

.32*  .20* 


.34*  .16* 


) Visual  Condition 


Interview  Performance 

Ratings  Ratings 


Visual  Cues 
Personal  Reactions 
Agreeableness 
Conscientiousness 
Extroversion 


110 


(Table  4-8) . Thus,  personality  traits 


easily  attributed  with  the  use  of  visual  cues,  except  the 
trait  of  extroversion,  which  can  be  attributed  by  using 
either  channel  of  information. 

The  relationships  established  above  can  now  be  tested 
with  the  theoretical  model  given  in  Figure  2-1.  Following 
Baron  and  Kenny  (1986),  this  mediating  model  is  tested  in 
three  steps.  First,  there  must  be  a significant 
relationship  between  the  independent  variable  and  the 
dependent  variable.  Second,  there  must  be  a significant 
relationship  between  the  mediator  and  the  independent 
variable.  Finally,  when  the  combined  effects  of  the 
mediator  and  the  dependent  variable  are  examined  in  a linear 
regression  equation,  the  mediator  must  still  be 
statistically  significant  and  the  independent  variable  must 
have  a lower  relationship  with  the  dependent  variable  than 
when  examined  individually.  Perfect  mediation  exists  when 
the  relationship  between  the  independent  and  the  dependent 
variable  is  driven  downward  to  zero  when  the  mediator  is 
introduced  into  the  equation  in  step  3. 

Hypothesis  set  5 predicts  the  mediating  effects  seen  in 
Figure  2-1  on  the  relationship  between  visual  cues  and 
interview  ratings.  Table  4-9  shows  that  when  the  combined 
effects  of  personal  reactions  as  the  mediator  and  visual 


cues  as  the  independent  variable  are  examined  on  interview 
ratings  (H5a) , the  mediator  is  significant  and  the 
independent  variable  is  driven  toward  zero  (0n  = .21,  p < 
.05,-  0vc  = .08,  ns).  Personal  reactions  strongly  mediate 
the  relationship  between  visual  cues  and  interview  ratings, 
given  the  nonsignificant  relationship  of  the  visual  cues  on 
the  dependent  variable  when  the  mediator  is  added  to  the 
equation. 

Hypothesis  5b  was  not  supported  (Table  4-9) . The 
mediator  extroversion  was  not  significant  in  the  combined 
effects  regression  equation  ( - .01,  ns;  Bvc  ~ >20,  p < 
.05).  H5c  predicted  that  conscientiousness  would  mediate  the 
visual  cue  - interview  rating  relationship,  and  it  was 
supported  (Be*  ■ .21,  p < .05;  Bp«  = -14,  ns).  Finally,  H5d 
was  not  supported.  Agreeableness  attributions  made  from  the 
visual  channel  do  not  mediate  the  relationship  between 
visual  cues  and  interview  ratings  (Bern  = -.03,  ns;  0vc  = .21, 
p < .05). 

The  next  set  of  hypotheses  test  the  model  from  the 
audio  cue  side  (see  Table  4-10) . First,  personal  reactions 


the  speaker  derived  from 


Combined  effec 


Combined  effec 


rating  relationship  to  confirm 


feels  affectively  toward  a manager,  given  only  audio  cue 
information  about  the  manager,  partially  explains  the 
relationship  between  audio  cues  and  interview  ratings. 

Similarly,  extroversion  mediates  this  relationship  ip„ 
- .26,  p < .05;  Pac  = .24,  p < .05)  . The  degree  to  which  a 
manager  is  attributed  the  trait  extroversion  is  a causal 
link  between  audio  cues  and  interview  ratings. 
Conscientiousness  also  was  found  to  mediate  the  audio  cue 
and  interview  rating  relationship  (Scon  a .39 , p < .05;  Pac  s 

however,  where  the  individual  effects  of  the  mediator  are 
not  significantly  related  to  the  cues,  but  still  mediate  the 
relationship  in  the  combined  effects  model. 

Testing  for  agreeableness  as  a mediator  did  not  pass 
the  conditions  set  out  in  step  2,  i.e.,  the  relationship 
between  agreeableness  and  audio  cues  was  not  significant. 
Here  though,  agreeableness  was  not  significant  in  the 
combined  effects  model,  and  thus,  cannot  be  a mediator  (Pag 

To  this  point,  the  analyses  discussed  have  tested  the 
mediating  effects  on  both  types  of  cues  and  interview 
ratings.  These  mediator  analyses  are  repeated  with  job 
performance  ratings  as  the  dependent  variable.  First, 


IR:  Individual  effects 
AC:  Individual  effects 


£fi 

.20* 


AC 


AC 


AC 

.32* 


AC 


= Conscientiou 


hypothesis  set  H7  examines  the  mediating  effects  on  the 
visual  cue  - job  performance  relationship.  Table  4-11 
provides  evidence  that  none  of  the  hypotheses  are  supported 
involving  the  visual  cue  - job  performance  rating 
relationship. 

For  the  audio  cue  - job  performance  hypotheses , only 
H8a  held  (see  Table  4-12) . The  relationship  between  job 
performance  ratings  and  audio  cues  can  be  partially 
explained  by  personal  reactions  (Pm  - .23,  p < .05;  - 

.16,  ns) . None  of  the  other  hypotheses  were  supported. 

This  completes  the  analyses  from  Part  One  of  the  study. 

Research  Model  Testing:  Part  Two 
In  the  first  section  of  Part  Two  of  the  study,  the 
effects  of  nonverbal  cues  on  the  interview  validity  and 
audio  cues  respectively  are  partially  the  underlying  cause 
of  the  validity  coefficient  for  the  interview  ratings  and 
job  performance.  Both  hypotheses  are  supported  to  an 
extent,  though  audio  cues  explain  more  of  the  variance  in 
the  validity  coefficient  in  this  study  (Table  4-13) . Though 
the  validity  coefficient  only  drops  .03  from  the  zero  order 
to  first  order  correlation,  this  corresponds  to  an  Hi  drop 
in  variance  accounted  for  by  visual  cues.  Moreover,  when 
cues  are  partialled  out  of  the 


effects  of  audio 


MEDIATION  RESULTS  FOR  VISUAL  CUES  AND  JOB  PERFORMANCE 
STANDARDIZED  REGRESSION  COEFFICIENTS 


EE  YC 


PERF:  Combined  effects 


EX 


.31* 


&S 


ac 


ac 


ZERO  AND  FIRST  ORDER  CORRELATIONS  BETWEEN  INTERVIEW  RATINGS 
AND  PERFORMANCE  RATINGS 


Performance  Ratings 


Interview  Ratings 


es  controlled 
5 controlled 
controlled 


interview  rating  - job  performance  rating  relationship,  27% 
of  the  variance  is  reduced.  While  both  types  of  cues  are 
partial  causes  of  the  relationship  between  interview  ratings 
and  job  performance  ratings,  audio  cues  account  for  more 
than  one-quarter  variance  reduction  when  partialled  out. 
Finally,  controlling  for  both  types  of  cues  reduces  the 
variance  explained  by  50% . 

In  the  second  section  of  Part  Two,  a Brunswik  lens 
model  is  used  to  assess  the  degree  to  which  interview  raters 
and  job  performance  raters  similarly  use  different 


constructs  when  making  their  ratings.  To  examine  Question 
1,  the  individual  visual  cues  were  regressed  on  both  job 
performance  ratings  and  interview  judgments.  Predictions 
made  by  this  regression  model  are  saved  as  variables  and 
correlated  to  give  the  "matching  index."  The  matching  index 
is  understood  as  the  degree  to  which  both  sides  of  the  model 
similarly  use  the  information  items.  Figure  4-1  shows  the 
results  of  this  analysis.  The  matching  index,  or  the 
corrected  validity  coefficient  (Guion,  1990)  is  r = .56. 

This  shows  the  maximum  job  relatedness  of  the  interview 
judgments  given  only  visual  cues,  it  is  interesting  that 
the  multiple  correlation  for  the  predictability  of  the 
criterion  given  only  these  visual  cues  (R  - .22)  is  close  to 
the  validity  coefficient  (r  = .35) . A similar  result  is 
found  on  the  interview  rating  side  of  the  model  (R  = .18). 

Figure  4-2  has  the  results  for  Question  2 regarding 
audio  cues.  The  matching  index  is  much  smaller  here,  r = 
.28,  showing  significant  similarity  of  cue  usage,  but  not  as 
much  as  the  visual  cues.  In  fact,  this  matching  index  is 
not  even  as  high  as  the  validity  coefficient.  One  could 
argue  that  the  combined  usage  of  audio  cues  by  both  types  of 
raters  is  suppressing  the  validity  coefficient.  However, 
similar  to  visual  cues,  the  multiple  correlation  for  the 
predictability  of  the  criterion  given  audio  cues  (R  - .27) 
is  very  close  to  the  validity  coefficient  (r  = .35)  as  is 


Che  multiple  correlation  for  the  predictability  of  the 
interview  judgments  (R  — .33). 

Question  3 is  examined  in  Figure  4-3.  When  the 
personality  attributions  made  from  both  channels 
individually  are  combined  to  reflect  overall  personality 
attribution,  the  matching  index  is  very  high  (r  = .81). 

This  suggests  that  interview  raters  and  job  performance 
raters  almost  equally  use  personality  attributions  when 
making  their  ratings.  The  multiple  correlations  indicate 
good  predictability  of  both  the  criterion  (R  =.27)  and 
interview  judgments  (R  = .54)  using  personality 
attributions . 

The  last  question  in  Part  Two  examines  the  use  of 
personal  reactions  by  both  types  of  raters . Figure  4-4 
shows  a similarly  high  matching  index  as  for  personality 
attributions  (r  = .75) . Again,  this  indicates  equal  usage 
of  personal  reactions  when  making  different  types  of 
ratings.  Zt  also  shows  higher  validity  for  the  personal 

validity  coefficient  from  the  interview.  Using  personal 
reactions  to  nonverbal  cues,  one  can  predict  job  performance 
(R  = .28)  and  interview  judgments  (R  = .44)  very  well.  The 

as  described  above  in  personality  attributions. 


=.35 


lated  validity  R - Multiple  correlation  for 

j predictability  of  judgments 


0 NONVERBAL  COES 


i validity  coefficient  C predictability  of  criterion 

R - Multiple  correlation  for 

r . matching  index  j predictability  of  Jodgnente 


PERSONALITY  JUDGMENTS  FROM 
NONVERBAL  CUES 


FIGURE  4-4 

PERSONAL  REACTIONS  FROM 
NONVERBAL  COES 


DISCUSSION 


This  study  provides  evidence  that  there  is  a 
meaningful  relationship  between  both  visual  and  audio 
nonverbal  cues,  interview  favorability,  and  job  performance 
ratings.  These  relationships  are  shown  to  be  greatly 
mediated  by  personal  reactions  to  the  speaker,  and  to  a 
lesser  extent,  by  personality  attributes  of  the  speaker. 

Interview  FaypiablUty 

In  an  interview,  there  is  little  time  to  put  together 
all  information  at  hand  to  derive  subsequent  ratings  of 
favorability.  Thus,  raters  are  left  to  search  for  ways  to 
assess  the  ratee  using  the  behavioral  information  given  in 
the  interview  answers,  but  also  other  cues  from  both  the 
visual  and  audio  channels. 

The  data  examined  in  this  study  say  that  subordinates 
react  in  a personal  manner  to  the  nonverbal  cues  of 
managers.  When  subordinates  are  exposed  to  only  the  visual 
channel  of  information  from  their  manager,  their  personal 
reactions  to  what  they  see  act  as  the  mechanism  through 
which  the  cues  cause  interview  favorability.  In  other 
words,  the  correlation  between  visual  cues  and  interview 


ratings  may  not  be  attributable  to  stereotyping  error  if  the 
affective  orientation  that  listeners  have  toward  the  speaker 
indicate  how  one  would  react  on  the  job.  In  this  study,  the 
listener  was  trained  as  the  subordinate  to  the  speaker,  who 
was  the  manager.  These  results  suggest  that  favorable 
ratings  in  the  interview  that  are  associated  with  visual 
cues  would  translate  to  the  job  where  the  subordinate  would 
react  more  positively  to  the  manager,  given  higher  levels  of 

Conscientiousness  also  was  found  to  mediate  the 
relationship  between  visual  cues  and  interview  ratings.  The 
correlation  between  visual  cues  and  interview  ratings  is 
explained  when  a manager  is  deemed  responsible  and 
dependable  by  a listener.  This  has  even  greater 
ramifications  for  firms  since  conscientiousness  is  a job 
relevant  construct  (e.g.,  Barrick  a Mount,  1991;  Hough  et 
al.,  1990). 

When  listeners  are  exposed  to  only  the  audio  channel  of 
information  of  the  speaker,  their  personal  reactions 
partially  mediate  the  relationship  between  audio  cues  and 
interview  ratings.  Though  not  as  strong  an  effect  as  found 
with  the  visual  cues,  the  way  that  one  reacts  to  what  is 
heard,  without  seeing  the  speaker,  is  a mechanism  for  this 
linkage.  This  suggests  that  the  way  a speaker's  voice 
sounds  acts  in  a similar  manner  to  how  the  speaker  looks. 


Zf  this  represents  stereotyping,  it  has  organizational 
relevance  for  effective  performance.  Similar  to  the  visual 
cues,  if  a subordinate  finds  the  manager's  voice  attractive, 
that  subordinate  reacts  in  a more  positive  affective  manner 
to  the  manager.  Stated  another  way,  subordinates  react 
systematically  to  audio  cues,  making  it  reasonable  to 
believe  that  people  who  possess  optimum  cues  will  be  more 
effective  as  a manager. 

Extroversion  also  was  shown  to  be  a causal  link  between 
audio  cues  and  interview  ratings.  If  extroversion  is 
attributed  to  the  manager,  that  manager's  audio  cues  reflect 
it  and  higher  interview  ratings  result.  It  can  be  argued 
that  in  the  interview,  an  extroverted  person  is  at  a 
significant  advantage  to  an  introverted  person  because 
impression  management  is  more  deftly  used  when  one  is  an 
extrovert.  From  this  study,  it  also  can  be  argued  that 
since  extroversion  is  attributed  solely  from  audio  cues, 
cues  that  are  difficult  to  change,  these  attributions  are 
indicating  an  internal  trait,  not  impression  management 

This  work  also  shows  a significant  correlation  between 
visual  and  audio  nonverbal  cues  and  job  performance  ratings. 
With  interview  ratings,  one  explanation  was  stereotyping, 
given  an  interviewer's  limited  information.  Impression 


management  is  also  an  explanation  for  the  relationship 
between  nonverbal  cues  and  interview  ratings.  These 
explanations  cannot  hold  for  job  performance  ratings. 
Supervisors  have  access  to  much  more  information  about  the 
ratee  making  the  issues  of  stereotyping  from  limited 
information  and  the  use  of  ratee  impression  management  in 
this  rating  setting  moot.  It  is  a more  complex  task  to 
explain  nonverbal  cues  correlating  with  job  performance. 

Results  from  this  study  show  that  personal  reactions 
from  audio  cues  is  a mechanism  for  the  audio  cue/job 
performance  rating  link.  The  relationship  between  higher 
manager  ratings  of  job  performance  and  their  audio  cues 
cause  subordinates  to  help  them  more,  trust  them  more, 
comply  with  them  more,  and  have  overall  higher  affective 
personal  reactions  to  the  manager.  Subordinates  respond  to 
certain  audio  cues  more  positively  than  others.  A high- 
pitched  voice,  one  that  uses  little  variation  in  pitch, 
hesitates  infrequently,  does  not  vary  the  amplitude,  and  is 
slow  paced,  is  not  associated  with  effectiveness.  But  since 
these  cues  cause  subordinates  to  be  more  supportive  of  their 
managers,  job  performance  of  the  manager  increases. 
Essentially,  this  raises  the  possibility  that  these  cues  may 
belong  in  the  performance  domain. 


Interview  Validity 

In  this  stud/,  a significant  amount  of  variance  in  the 
validity  coefficient  was  explained  by  nonverbal  cues, 

cues  are  partialled  out  together,  half  the  variance 
explained  in  the  validity  coefficient  is  lost.  That  is, 
nonverbal  cues  explain  half  the  variance  in  the  interview 
validity.  This  suggests  the  cues  belong  in  the  performance 
domain.  If  concerns  associated  with  interview  ratings  and 
nonverbal  cues  can  be  allayed,  making  this  suggestion  might 
be  more  palatable.  Afterall,  if  nonverbal  cues  belong  in 
the  job  performance  criterion  domain,  what  could  ultimately 
be  suggested  is  to  use  nonverbal  cues  in  organizational 
decisions . 

The  analysis  from  the  Brunswik  lens  model  provides 
different  information.  The  two  sides  of  the  model  showed 
agreement  on  the  usage  of  nonverbal  cues,  personal 
reactions,  and  personality  attributions.  To  the  extent  of 
the  varying  matching  indices  from  the  Brunswik  lens  model 
analysis,  there  is  evidence  that  these  different  constructs 
are  utilized  when  assessing  the  performance  of  others.  This 
finding  moves  toward  offering  a partial  rebutment  to  the 
notion  that  nonverbal  cues  are  error  in  interview  judgments. 
If  the  cues  are  used  similarly  in  job  performance  rating 


attributed 


suggested  in  interviews,  there  is  further  support  for  the 
notion  that  the  cues  indicate  underlying  traits  that  are 
important  on  the  job. 

Conclusion 

The  effects  of  manager's  nonverbal  cues  on  their  job 
performance  ratings  and  interview  ratings  may  be  explained 
in  a couple  of  ways.  Perhaps  there  are  shared  biases  from 
supervisor  to  interview  rater.  These  would  be  things  that 
society  values  or  holds  biases  toward.  An  example  concerns 
decisions  about  political  candidates.  Research  has  shown 
these  decisions  often  are  made  on  the  basis  of  appearance  or 
a "general  feeling"  rather  than  content  (Brownlow,  1992) . 

Another  interpretation  is  that  the  cues  are  behavioral 
manifestations  of  underlying  traits.  The  results  from  this 
study  lean  toward  this  conclusion.  Interviews  may  have 
stereotyping  and  impression  management,  but  job  performance 
ratings  should  not.  The  degree  of  similar  usage  of  the  cues 
shows  they  are  revealing  something  other  than  stereotyping 
and  impression  management.  Job  performance  ratings  have 
only  one  explanation  from  this  study:  personal  reactions 

Limitations  to  this  study  can  be  overcome  with  replication 
and  expansion  of  the  research  model  used  in  this  study.  It 
could  contain  other  mechanisms  through  which  the 


relationship  between  nonverbal  cues#  interview  f avorability, 
and  job  performance  ratings  can  be  examined. 


TS 


Issr  ■ “ 


‘sa°at,. 


gaaisau'ggr 


as  r ;Las:.r 


ham,  G.P.  (1989).  The  reliability,  validity,  a 
practicality  of  the  situational  interview.  1 


Psychology.  65. 


ilsal . Academy 


Macdonald,  T.,  s Hakel,  M.D.  (1985).  Effects  of  applicant 

questioning  strategy  and  ratings.  Personnel 
Psychology.  38.  321-334. 


b applicant  responses. 
2A,  72-80. 


Springfie 


Rousey,  C.L.  i Mitchell,  K.  (1974).  Selection  of  applicants 

Psychiatric  assessment  by  speech  and  hearing  behavior. 
Springfield,  IL:  Charles  C.  Thomas. 

Sackett,  P.R.  (1982) . The  interviewer  as  hypothesis  tester: 
The  effects  of  impressions  of  an  applicant  on 
interviewer  questioning  strategy.  Personnel 
Psychology.  35.  789-803. 


Scherer,  K.R.  (1979) . Personality  markers  in  speech.  In 
K.R.  Scherer  & H.  Giles  (Eds.),  Social  markers  in 
speech  (pp. 58-79)  Cambridge:  Cambridge  University 


APPENDIX 


The  following  six  pages  contain  the  six  behaviorally 
anchored  scales  used  by  both  the  interview  raters  and  the 
supervisors. 


VISUAL  NONVERBAL 


EXPERIENCE  MANAGEMENT  EXPERIENCE 


interview? 


APPENDIX  D 

PERSONALITY  ATTRIBUTION  SCALES 
Adjectives  describing  personality  traits  appear  on  the  lef 
point  scale  running  between  them.  Please  describe  your 
impressions  of  the  previous  person  by  circling  one  number 
pair  of  opposites. 

(Scale  is  directly  below  on  actual  instrument,  but  it  is 


Talkative 
Not  Jealous 

Anxious 


Negativistic 
Quitting,  Fickle 

Unreflexive, 


Jealous 


Cautious 

Persevering 

Intellectual 


Irritable 

Crude,  Boorish 
Headstrong 

Simple,  Direct 


Hypochondriacal 
7 Reclusive 


PERSONAL  REACTION  SCALE 

w trustworthy  do  you  feel  this  person  is  as  your  manager? 


t trustworthy 


never  comply  neutral  always  comply 

extremely  dislike  neutral  extremely  like 

4 . The  degree  to  which  this  person  could  persuade  me  to  do  things 


average  extremely  di 


extremely  competent 


PERSONAL  REACTION  SI 
s person  was  my  boss,  I w 


strongly  disagree 


BIOGRAPHICAL  SKETCH 

Timothy  Gerald  DeGroot  was  born  February  14,  I960,  in 
Windsor,  Ontario,  Canada.  He  worked  in  retail  for  over  ten  years 

married  to  Bonnie  for  15  years.  Together,  they  have  four 
children:  Alison,  Joyce,  Mackenzie,  and  Kellen.  He  has  accepted 

in  Washington,  D.C. 


127 


